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Description 

Technical Field 

5 [0001] This invention is in the fields of genetic engineering, plant biology, and bacteriology. 
Background Art 

[0002] In the past decade, the science of genetic engineering has developed rapidly. A variety of processes are 

10 known for inserting a heterologous gene into bacteria, whereby the bacteria become capable of efficient expression 
of the inserted genes. Such processes normally involve the use of plasmids which may be cleaved at one or more 
selected cleavage sites by restriction endonucleases. discussed below. Typically, a gene of interest is obtained by 
cleaving one piece of DNA and the resulting DNA fragment is mixed with a fragment obtained by cleaving a vector 
such as a plasmid. The different strands of DNA are then connected ("ligated") to each other to form a reconstituted 

IS plasmid. See, for example, U.S. Patents 4.237.224 (Cohen and Boyer, 1980); 4.264,731 (Shine. 1981); 4.273,875 
(Manis, 1981); 4,322,499 (Baxter et al, 1982). and 4,336,336 (Silhavy et al. 1 982). A variety of other reference works 
are also available. Some of these works describe the natural processes whereby DNA is transcribed into messenger 
RNA (mRNA) and mRNA is translated Into protein; see, e.g., Stryer, 1981 (note: all references cited herein, other than 
patents, are listed with citations after the Examples); Lehninger. 1975. Other works describe methods and products of 

20 genetic manipulation; see. e.g.. Manlatis et al, 1982; Setlow and Hollaender, 1979. 

[0003] Most of the genetic engineering work performed to date involves the insertion of genes into various types of 
cells, primarily bacteria such as E. coli. various other types of microorganisms such as yeast, and mammalian cells. 
However, many of the techniques and substances used for genetic engineering of animal cells and microorganisms 
are not directly applicable to genetic engineering involving plants. 

2S [0004] As used herein, the term "plant" refers to a multicellular differentiated organism that Is capable of photosyn- 
thesis, such as angiosperms and multicellular algae. This does not include microorganisms, such as bacteria, yeast, 
and fungi. However the term "plant cells" includes any cell derived from a plant; this includes undifferentiated tissue 
such as callus or crown gall tumor, as well as plant seeds, propagules. pollen; and plant embryos. 
[0005] A variety of plant genes have been isolated, some of which have been published and/or are publicly available. 

30 Such genes Include the soybean actin gene (Shah el at 1 982), corn zein (Pederson et al, 1 982) soybean leghemoglobin 
(Hyldlg-Nielsen et al, 1982), and soybean storage proteins (Fischer and Goldberg, 1982). 

The ReQlons of a Gene 

35 [0006] The expression of a gene involves the creation of a polypeptide which is coded for by the gene. This process 
involves at least two steps: part of the gene is transcribed to form messenger RNA. and part of the mRNA is translated 
into a polypeptide. Although the processes of transcription and translation are not fully understood, it is believed that 
the transcription of a DNA sequence into mRNA is controlled by several regions of DNA. Each region is a series of 
bases (i.e., a series of nucleotide residues comprising adenosine (A), thymidine (T), cytidine (C), and guanidine (G)) 

40 which are in a desired sequence. Regions which are usually present in a eucaryotic gene are shown on Figure 1 . These 
regions have been assigned names for use herein, and are briefly discussed below. It should be noted that a variety 
of terms are used in the literature, which describes these regions in much more detail. 

[0007] An association region 2 causes RNA polymerase to associate with the segment of DNA. Transcription does 
not occur at association region 2; instead, the RNA polymerase normally travels along an Inten/ening region 4 for an 
45 appropriate distance, such as about 100-300 bases, after it is activated by association region 2. 

[0008] A transcription initiation seguence 6 directs the RNA polymerase to begin synthesis of mRNA. After it recog- 
nizes the appropriate signal, the RNA polymerase is believed to begin the synthesis of mRNA an appropriate distance, 
such as about 20 to about 30 bases, beyond the transcription initiation sequence 6. This is represented in Figure 1 by 
intervening region 8. 

so [0009] The foregoing sequences are referred to collectively as the promoter region of the gene. 

[0010] The next sequence of DNA is transcribed by RNA polymerase into messenger RNA which is not translated 
into protein. In general, the 5' end of a strand of mRNA attaches to a ribosome. In bacterial cells, this attachment is 
facilitated by a sequence of bases called a "ribosome binding site" (RBS). However, in eucaryotic cells, no such RBS 
sequence is known to exist. Regardless of whether an RBS exists in a strand of mRNA, the mRNA moves through the 

55 ribosome until a "start codon" is encountered. The start codon is usually the series of three bases, AUG; rarely the 
codon GUG may cause the initiation of translation. The non-translated portion of mRNA located between the 5' end of 
the mRNA and the start codon is referred to as the 5] non-translated region 10 of the mRNA. The corresponding 
sequence in the DNA Is also referred to herein as 5; non-translated region 12. The specific series of bases in this 
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sequence is not believed to be of great importance to the expression of the gene; however, the presence of a premature 
start codon might affect the translation of the mRNA (see Kozak, 1978). 

[0011] A promoter sequence may be significantly more complex than described above; for example, certain promot- 
ers present in bacteria contain regulatory sequences that are often referred to as "operators." Such complex promoters 

s may contain one or more sequences which are Involved in induction or repression of the gene. One example is the lac 
operon, which normally does not promote transcription of certain lactoseutilizing enzymes unless lactose Is present in 
the cell. Another example is the trp operator, which does not promote transcription or translation of certain tryptophan- 
creating enzymes if an excess of tryptophan is present in the cell. See, e.g., Miller and Reznikoff, 1982. 
[0012] The next sequence of bases Is usually called the coding sequence or the structural sequence 14 (in the DNA 

10 molecule) or 16 (in the mRNA molecule). As mentioned above, the translation of a polypeptide begins when the mRNA 
start codon, usually AUG, reaches the translation mechanism in the ribosome. The start codon directs the ribosome 
to begin connecting a series of amino acids to each other by peptide bonds to form a polypeptide, starting with me- 
thionine, which always forms the amino terminal end of the polypeptide (the methionine residue may be subsequently 
removed from the polypeptide by other enzymes). The bases which follow the AUG start codon are divided into sets 

IS of 3. each of which is a codon. The "reading frame", which specifies how the bases are grouped together into sets of 
3, is determined by the start codon. Each codon codes for the addition of a specific amino acid to the polypeptide being 
formed. The entire genetic code (there are 64 different codons) has been solved; see, e.g., Lehninger, supra, at p. 
962. For example, CUA is the codon for the amino acid leucine; GGU specifies glycine, and UGU specifies cysteine. 
[0013] Three of the codons (UAA, UAG, and UGA) are "stop" codons; when a stop codon reaches the translation 

20 mechanism of a ribosome, the polypeptide that was being formed disengages from the ribosome, and the last preceding 
amino acid residue becomes the carboxy terminal end of the polypeptide. 

[0014] The region of mRNA which is located on the 3' side of a stop codon in a monocistronic gene is referred to 
herein as 3; non-translated region 18. This region 18 is believed to be involved in the processing, stability, and/or 
transport of the mRNA after it is transcribed. This region 18 is also believed to contain a sequence of bases, polv- 
25 adenylation signal 20, which Is recognized by an enzyme in the cell. This enzyme adds a substantial number of ade- 
nosine residues to the mRNA molecule, to form poly-A tail 22. 

[0015] The DNA molecule has a 3; non-translated region 24 and a poiv-adenvlation signal 26, which code for the 
corresponding mRNA region 1 8 and signal 20. However, the DNA molecule does not have a poly-A tail. Poly-adenylation 
signals 20 (mRNA) and 26 (DNA) are represented in the figures by a heavy dot. 

30 

Gene-Host Incompatibllitv 

[0016] The same genetic code is utilized by all living organisms on Earth. Plants, animals, and microorganisms all 
utilize the same correspondence between codons and amino acids. However, the genetic code applies only to the 
35 structural sequence of a gene, i.e., the segment of mRNA bounded by one start codon and one stop codon which 
codes for the translation of mRNA Into polypeptides. 

[0017] However, a gene which performs efficiently In one type of cell may not perform at all in a different type of cell. 
For example, a gene which is expressed in E, coll may be transferred into a different type of bacterial cell, a fungus, 
or a yeast. However, the gene might not be expressed in the new host cell. There are numerous reasons why an intact 
40 gene which is expressed in one type of cell might not be expressed in a different type of cell. See, e.g., Sakaguchi and 
Okanishi, 1981. Such reasons include: 

1. the gene might not be replicated or stably inherited by the progeny of the new host cell. 

2. the gene might be broken apart by restriction endonucleases or other enzymes In the new host cell. 

45 3. the promoter region of the gene might not be recognized by the RNA polymerases in the new host cell. 

4. one or more regions of the gene might be bound by a repressor protein or other molecule In the new host cell, 
because of a DNA region which resembles an operator or other regulatory sequence of the host's DNA. For ex- 
ample, the lac operon includes a polypeptide which binds to a particular sequence of bases next to thejac promoter 
unless the polypeptide is itself inactivated by lactose. See, e.g., M\\\er and Reznikoff, 1 982. 

50 5. one or more regions of the gene might be deleted, reorganized, or relocated to a different part of the host's 

genome. For example, numerous procaryotic cells are known to contain enzymes which promote genetic recom- 
bination; (such as the rec proteins In E, colj; see, e.g., Shibata et al, 1979) and transposition (see, e.g., The 45th 
Cold Spring Harbor Symposium on Quantitative Biology, 1 981 ). In addition, naturally-occurring genetic modificatton 
can be enhanced by regions of homology between different strands of DNA; see, e.g., Radding, 1978. 

55 6. mRNA transcribed from the gene may suffer from a variety of problems. For example, it might be degraded 

before it reaches the ribosome, or it might not be poly-adenylated or transported to the ribosome, or it might not 
interact properly with the ribosome, or it might contain an essential sequence which is deleted by RNA processing 
enzymes. 
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7. the polypeptide which is created by translation of the mRNA coded tor by the gene may suffer from a variety of 
problems. For example, the polypeptide may have a toxic effect on the cell, or it may be glycosylated or converted 
into an altered polypeptide, or it may be cleaved Into shorter polypeptides or amino acids, or it nnay be sequestered 
within an intracellular compartment where it is not functional. 

5 

[0018] In general, the likelihood of a foreign gene being expressed in a cell tends to be lower if the new host cell is 
substantially different from the natural host cell. For example, a gene from a certain species of bacteria is likely to be 
expressed by other species of bacteria within the same genus. The gene is less likely to be expressed by bacteria of 
a different genus, and even less likely to be expressed by non-bacterial microorganisms such as yeast, fungus, or 
10 algae. It is very unlikely that a gene from a cell of one kingdom (the three kingdoms are plants, animals, and "protista" 
(microorganisms)) could be expressed in cells from either other kingdom. 

[0019] These and other problems have, until now. thwarted efforts to obtain expression of foreign genes into plant 
cells. For example, several research teams have reported the insertion of foreign DNA into plant cells; see, e.g. , Lurquin, 
1979; Krens et al, 1982; Davey et al, 1980. At least three teams of researchers have reported the insertion of entire 
IS genes into plant cells. By use of radioactive DNA probes, these researchers have reported that the foreign genes (or 
at least portions thereof) were stably inherited by the descendants of the plant cells. See Hernalsteens et al. 1980; 
Garfinkel et al, 1 981 ; Matzke and Chilton, 1 981 . However, there was no reported evidence that the foreign genes were 
expressed in the plant cells. 

[0020] Several natural exceptions to the gene-host incompatibility barriers have been discovered. For example, sev- 
20 eral E. coH genes can be expressed in certain types of yeast cells, and vice-versa. See Beggs, 1 978; Struhl et al, 1 979. 
[0021] In addition, certain types of bacterial cells, including AQrobacterium tumefaciens and A. rhizogenes, are ca- 
pable of infecting various types of plant cells, causing plant diseases such as crown gall tumor and hairy root disease. 
These Agrobacterium cells carry plasmids, designated as Tl plasmids and Ri plasmids, which carry genes which are 
expressed in plant cells. Certain of these genes code for enzymes which create substances called 'opines," such as 
25 octopine, nopaline, and agropine. Opines are utilized by the bacteria cells as sources of carbon, nitrogen, and energy 
See, e.g., Petit and Tempe, 1978. The opine genes are believed to be inactive while in the bacterial cells; these genes 
are expressed only after they enter the plant cells. 

[0022] In addition, a variety of man-made efforts have been reported to overcome one or more of the gene-host 
incompatibility barriers. For example, it has been reported that a mammalian polypeptide which is normally degraded 

30 within a bacterial host can be protected from degradation by coupling the mammalian polypeptide to a bacterial polypep- 
tide that normally exists in the host cell. This creates a 'lusion protein;" see, e.g., Itakura et al, 1977. As another 
example, in order to avoid cleavage of an inserted gene by endonucleases in the host cell, it is possible to either (1) 
insert the gene into host cells which are deficient in one or more endonucleases, or (2) duplicate the gene in cells 
which cause the gene to be methylated. See, e.g., Maniatis et al, 1981. 

35 [0023] In addition, various efforts to overcome gene-host incompatibility barriers involve chimerk; genes. For exam- 
ple, a structural sequence which codes for a mammalian polypeptide, such as insulin, interferon, or growth hormone, 
may be coupled to regulatory sequences from a bacterial gene. The resulting chimeric gene may be inserted into 
bacterial cells, where it will express the mammalian polypeptide. See, e.g., Guarente et al, 1980. Alternately, structural 
sequences from several bacterial genes have been coupled to regulatory sequences from viruses which are capable 

40 of infecting mamnnalian cells. The resulting chimeric genes were inserted into mammalian cells, where they reportedly 
expressed the bacterial polypeptkje. See, e.g.. Southern and Berg, 1982; Colbere-Garapin etal, 1982. 

Restriction Endonucleases 

45 [0024] In general, an endonuclease is an enzyme which is capable of breaking DNA into segments of DNA. An endo 

nuclease Is capable of attaching to a strand of DNA somewhere In the middle of the strand, and breaking it. By com- 
parison, an exo nuclease removes nucleotides from the end of a strand of DNA. All of the endonucleases discussed 
herein are capable of breaking double-stranded DNA into segments. This may require the breakage of two types of 
bonds: (1) covalent bonds between phosphate groups and deoxyribose residues, and (2) hydrogen bonds (A-T and 
50 c-G) which hold the two strands of DNA to each other. 

[0025] A "restriction endonuclease" (hereafter referred to as an endonuclease) breaks a segment of DNA at a precise 
sequence of bases. For example, EcoRI and Haelll recognize and cleave the following sequences: 
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ECORI: 5*« 




XXG ^ 
YYCTTAA 



AAITCXX 
GYY 



10 



Haelll: GG:C 
CC3G 



75 



XXGC 
YYCC 



CCXX 
GGYY 



20 



25 



[0026] In the examples cited above, the EcoRI cleavage created a "cohesive" end with a 5' overtiang (I.e., the single- 
stranded "tail" has a 5' end rather than a 3" end). Cohesive ends can be useful In promoting desired ligations. For 
example, an EcoRI end is more likely to anneal to another EcoRI end than to a Haelll end. 

[0027] Over 1 00 different endonucleases are known, each of which is capable of cleaving DNA at specific sequences. 
See, e.g., Roberts, 1982. All restriction endonucleases are sensitive to the sequence of bases. In addition, some 
endonucleases are sensitive to whether certain bases have been methylated. For example, two endonucleases. Mbol 
and SauSa are capable of cleaving the folbwing sequence of bases as shown: 



30 




S'-XX 

YYCTAG 



GATCXX 
YY 



35 



40 



[0028] Mbol cannot cleave this sequence if the adenine residue is methylated (me-A). SauSa can cleave this se- 
quence, regardless of whether either A is methylated. To some extent the methylation (and therefore the cleavage) of 
a plasmid may be controlled by replicating the plasmids in cells with desired methylation capabilities. AnE.coli enzyme. 
DNA adenine methylase (dam), methylates the A residues that occur in GATC sequences. Strains of E. colt which do 
not contain the dam enzyme are designated as dam-cells. Cells which contain dam are designated as dam + or dam 
cells. 

[0029] Several endonucleases are known which cleave different sequences, but which create cohesive ends which 
are fully compatible with cohesive ends created by other endonucleases. For example, at least five different endonu- 
cleases create 5' GATC overhangs, as shown in Table 1 . 
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Table 1 

^ Endonucleaae Sequence 

Mbol 

^0 ZzUiibited by me*A 

Sau3a same as Mbol 

IS Unaffected by me^A 

B9IIZ 

20 unaffected by me«A 

BCII 

^5 Inhibited by me-A 






BamHI 

30 

unaffected by me-A 

[0030] A cohesive end created by any of the enconucleases listed in Table 1 will ligate preferentially to a cohesive 
35 end created by any of the other endonucleases. However, a ligation of. for example, a Bglll end with a BamHI end will 
create the following sequence: 




AGATCC 
TCTACC 

[0031] This sequence cannot be cleaved by either Bgl II or BamHI; however, it can be cleaved by Mbol (unless 
methylated) or by Sau3a. 

45 [0032] Another endonuclease which involves the GATC sequence is Pvul, which creates a 3* overhang, as folbws: 



so 



CGAjCG 
FACC 



[0033] Another endonuclease, Clal, cleaves the following sequence: 
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5 




[0034] If is G, or if X2 is C, then tlie sequence may be cleaved by Mbol (unless methylated, in which case Clal is 
also inhibited) or SauSa. 

10 

Viral Promoters 

[0035] A virus is a microorganism comprising single or double stranded nucleic acid (DNA or RNA) contained within 
a protein (and possibly lipid) shell called a "capsid" or "coat". A virus Is smaller than a cell, and it does not contain most 
IS of the components and substances necessary to conduct most biochemical processes. Instead, a virus infects a cell 
and uses the cellular processes to reproduce itself. 

[0036] The following is a simplified description of how a DNA-containing virus infects a cell; RNA viruses will be 
disregarded in this introduction for the sake of clarity. First, a virus attached to or enters a cell; normally called a "host" 
cell. The DNA from the virus (and possible the entire viral particle) enters the host ceil where it usually operates as a 

20 plasmid (a loop of extra-chromosomal DNA). The viral DNA is transcribed into messenger RNA, which is translated 
into one or more polypeptides. Some of these polypeptides are assembled into new capsids, while others act as en- 
zymes to catalyze various biochemical reactions. The viral DNA is also replicated and assembled with the capsid 
polypeptides to form new viral particles. These viral particles may be released gradually by the host cell, or they may 
cause the host cell to lyse and release them. The released viral particles subsequently infect new host cells. For more 

25 background information on viruses see, e.g., Stryer. 1981 and Matthews, 1970. 

[0037] As used herein, the term "virus" includes phages and viroids, as well as repllcative intermediates. As used 
herein, the phrases "viral nucleic acid" and "DNA or RNA derived from a virus" are construed broadly to include any 
DNA or RNA that is obtained or derived from the nucleic acid of a virus. For example, a DNA strand created by using 
a viral RNA strand as a template, or by chemical synthesis to create a known sequence of bases determined by 

30 analyzing viral DNA, would be regarded as viral nucleic acid. 

[0038] The host range of any virus (i.e. , the variety of cells that a type of virus is capable of infecting) is limited. Some 
viruses are capable of efficient infection of only certain types of bacteria; other viruses can infect only plants, and may 
be limited to certain genera; some viruses can infect only mammalian cells. Viral infection of a cell requires more than 
mere entry of the viral DNA or RNA into the host cell; viral particles must be reproduced within the cell. Through various 

35 assays, those skilled in the art can readily determine whether any particular type of virus is capable of infecting any 
particular genus, species, or strain of cells. As used herein, the term "plant virus" is used to designate a virus which is 
capable of infecting one or more types of plant cells, regardless of whether it can infect other types of cells. 
[0039] With the possible exception of viroids (which are poorly understood at present), every viral particle must 
contain at least one gene which can be "expressed" in infected host cells. The expression of a gene requires that a 

40 segment of DNA or RNA must be transcribed into or function as a strand of messenger RNA (mRNA), and the mRNA 
must be translated into a polypeptide. Most viruses have about 5 to 10 different genes, all of which are expressed in 
a suitable host cell. 

[0040] Promoters from viral genes have been utilized in a variety of genetic engineering applications. For example, 
chimeric genes have been constructed using various structural sequences (also called coding sequences) taken from 

45 bacterial genes, coupled to promoters taken from viruses which can infect mammalian cells (the most commonly used 
mammalian viruses are designated as Simian Virus 40 (SV40) and Herpes Simplex Virus (HSV)). These chimeric 
genes have been used to transform mammalian cells. See, e.g., Mulligan et at 1979; Southern and Berg 1982. In 
addition, chimeric genes using promoters taken from viruses which can infect bacterial cells have been used to trans- 
form bacterial cells; see, e.g., the phage lambda PL promoter discussed in Maniatis et al, 1982. 

so [0041] Several researchers have theorized that it might be possible to utilize plant viruses as vectors for transforming 
plant cells. See, e.g., Hohn et al, 1982. In general, a "vector" is a DNA molecule useful for transferring one or nriore 
genes Into a cell. Usually, a desired gene is inserted into a vector, and the vector is then used to infect the host cell. 
[0042] Several researchers have theorized that It might be possible to create chimeric genes which are capable of 
being expressed in plant cells, by using promoters derived from plant virus genes. See, e.g., Hohn etal, 1982, at page 

55 216. 

[0043] However, despite the efforts of numerous research teams, prior to this invention no one had succeeded In (1 ) 
creating a chimeric gene comprising a plant virus promoter coupled to a heterologous structural sequence and (2) 
demonstrating the expression of such a gene in any type of plant cell. 
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Cauliflower Mosaic Virus (CaMV^ 

[0044] The entire DNA sequence of CaMV has been published Gardner et al, 1981: Hohn et al, 1982. In its most 
common form, the CaMV genome is about 8000 bp long. However, various naturally occurring infective mutants which 
have deleted about 500 bp have been discovered; see Howarth et al 1981. The entire CaMV genome is transcribed 
into a single mRNA, with a sedimentation coefficient of 32S. The promoter for the 32S mRNA is located in the large 
intergenic region about 1 kb countercloclcwise from Gap 1 (see Guilley et al. 1982). 

[0045] CaMV is believed to generate at least eight proteins; the corresponding genes are designated as Genes I 
through VIII. Gene VI is transcribed into mRNA with a sedimentation coefficient of 195. The 1 9S mRNA is transcribed 
into a protein designated as P66. which is an inclusion body protein. The 19S mRNA is promoted by the 19S promoter, 
located about 2.5 kb counterclockwise of Gap 1. 

SUMMARY OF THE INVENTION 

[0046] This invention relates to chimeric genes which are capable of being expressed in plant cells, and to a method 
for creating such genes. 

[0047] The chimeric gene comprises a promoter region as specified according to the invention which is capable of 
causing RNA polymerase in a plant cell to create messenger RNA corresponding to the DNA. 
[0048] The chimeric gene also contains a sequence of bases which codes for a 5' non -translated region of mRNA 
which is capable of enabling or increasing the expression in a plant cell of a structural sequence of the mRNA. For 
example, a suitable 5' non-translated region may be taken from the NOS gene, from a plant virus gene, or from a gene 
which exists naturally in plant cells. 

[0049] The chimeric gene also contains a desired structural sequence, i.e., a sequence which is transcribed into 
mRNA which is capable of being translated into a desired polypeptide. The structural sequence is heterologous with 
respect to the promoter region, and it may code for any desired polypeptide, such as a bacterial or mammalian protein. 
The structural sequence includes a start codon and a stop codon. The structural sequence may contain introns which 
are removed from the mRNA prior to translation. 

[0050] The chimeric gene also contains a DNA sequence which codes for a 3' non-translated region (Including a 
poly-adenylation signal) of mRNA. This region may be derived from a gene which Is naturally expressed in plant cells, 
to help ensure proper expression of the structural sequence. Such genes include the NOS gene, plant virus genes, 
and genes which exist naturally in plant cells. 

[0051] The method of this invention is described below, and is summarized in the flow chart of Figure 2. 
[0052] If properly assembled and inserted into a plant genome, a chimeric gene of this invention will be expressed 
in the plant cell to create a desired polypeptide, such as a mammalian hormone, or a bacterial enzyme which confers 
antibiotic or herbicide resistance upon the plant. 

Brief Description of the Drawings 

[0053] The figures herein are schematic representations; they have not been drawn to scale. 
40 [0054] Figure 1 represents the structure of a typical eukaryotlc gene. 

[0055] Figure 2 is a flow chart representing the steps of this invention, correlated with an example chimeric NOS- 
NPTII-NOS gene, this specific gene is, however, not claimed herein. 

[0056] Figure 3 represents fragment Hindlll-23, obtained by digesting a Ti plasmid with Hindlll. 
[0057] Figure 4 represents a DNA fragment which contains a NOS promoter region, a NOS 5' non -translated region, 
45 and the first few codons of the NOS structural sequence. 

[0058] Figure 5 represents the cleavage of a DNA sequence at a precise locatbn, to obtain a DNA fragment which 
contains a NOS promoter region and complete 5' non -translated region. 

[0059] Figure 6 represents the creation of plasmids pMONIOOl and pMON40, which contain an NPTII structural 

sequence. 

so [0060] Figure? represents the insertion of a NOS promoter region into plasmid pMON40. to obtain pMON58. 

[0061] Figure 8 represents the creation of an M13 derivative designated as M-2, which contains a NOS 3' non- 
translated region and poly-A signal. 

[0062] Figure 9 represents the assembly of the NOS-NPTIl-NOS chimeric gene, and the insertion of the chimeric 
gene into plasmid pMON38 to obtain plasmids pMON75 and pMON76. 
55 [0063] Figure 10 represents the insertion of the NOS-NPTIl-NOS chimeric gene into plasmid pMON120 to obtain 
plasmids pMON128 and pMON129. 

[0064] Figure 11 represents the creation of plasmid pMON66, which contains an NPTl gene. 

[0065] Figure 12 represents the creation of plasmid pMON73, containing a chimeric NOS-NPTII sequence. 
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[0066] Figure 13 represents the creation of plasmid pMON78, containing a chimeric NOS-NPTI sequence. 
[0067] Figure 14 represents the creation of plasmids pMONIOS and pMON107, which contain chimeric NOS-NPTI- 
NOS genes. 

[0068] Figure 15 represents the insertion of a chimeric NOS-NPTI-NOS gene Into pMON120 to obtain plasmids 
5 pMON130andpMON131. 

[0069] Figure 16 represents the structure of a DNA fragment containing a soybean protein (sbss) promoter. 
[0070] Figure 17 represents the creation of plasmid pMON121 , containing the sbss promoter. 
[0071] Figure 18 represents the Insertion of a chimeric sbss-NPTII-NOS gene into pMON120 to create plasmids 
pMON141 and pMON142. 

10 [0072] Figure 19 represents the creation of plasmid pMONIOB. containing a bovine growth hormone structural se- 
quence and a NOS 3' region. 

[0073] Figure 20 represents the creation of plasmid N25-BGH, which contains the BGH-NOS sequence surrounded 
by selected cleavage sites. 

[0074] Figure 21 represents the Insertion of a chimeric sbss-BGH-NOS gene Into pMON120 to obtain plasmids 
IS pMON147 and pMON148. 

[0075] Figure 22 represents the creation of plasmid pMON149, which contains a chimeric NOS-BGH-NOS gene. 
This specific gene is not claimed herein. 

[0076] Figure 23 represents the creation of plasmid pMONB, which contains a structural sequence for EPSP syn- 
thase. 

20 [0077] Figure 24 represents the creation of plasmid pMON25, which contains an EPSP synthase structural sequence 
with several cleavage sites near the start codon. 

[0078] Figure 25 represents the creation of plasmid pMON146, which contains a chimeric sequence comprising 
EPSP synthase and a NOS 3' region. 

[0079] Figure 26 represents the Insertion of a chimeric NOS-EPSP-NOS gene into pMON120 to obtain plasmid 
25 pMON153. This specific gene is not claimed herein. 

[0080] Figure 27 represents the creation of plasmid pMON154, which contains a chimeric sbss-EPSP-NOS gene. 
This specific gene is not claimed herein. 

[0081] Figure 28 represents the creation and structure of plasmid pMON93, which contains a CaMV 1 9S promoter 

[0OQ2] Figure 29 represents the creation and structure of plasmid pMONI 56, which contains a chimeric CaMV-(1 9S)- 
30 NPT-NOS gene. This specific gene is not claimed herein. 

[0083] Figure 30 represents the creation and structure of plasmid pMONHO, which contains a partial NPT gene. 

[0084] Figure 31 represents the creation and structure of plasmid pMONI 32, which contains a partial NPT-NOS gene. 

[0OQ5] Figure 32 represents the creation and structure of plasmid pMONI 55, which contains a chimeric CaMV-(1 9S)- 

NPT-NOS gene. This specific gene is not claimed herein. 
35 [0086] Figure 33 represents the creation and structure of plasmid pDMONSI , which contains a CaMV 32S promoter. 

[0087] Figure 34 represents the creation and structure of plasmid pMON125, which contains a CaMV 32S promoter 

[0088] Figure 35 represents the creation and structure of plasmid pMON172, which contains a CaMV 32S promoter. 

[0089] Figure 36 represents the creation and structure of phage M12, which contains a CaMV 32S promoter, 

[009Q] Figure 37 represents the creation and structure of plasmids pMONI 83 and pMONI 84, which contain chimeric 
40 CaMV(32S)-NPT-NOS genes. 

DETAILED DESCRIPTION OF THE INVENTION 

[0091] The Invention relates to a chimeric gene capable of expressing a neomycin phosphotransferase polypeptide 
45 in plant cells conferring antibiotic resistance to the plant when inserted Into the plant genome, comprising in sequence: 

(a) a promoter region from a ribulose-1,5-bis-phosphate carboxylase small subunit gene; 

(b) a 5' non-translated region; 

(c) a structural coding sequence encoding neomycin phosphotransferase I or 11; and 

so (d) a 3' non-translated region of a gene naturally expressed in plant cells, said region encoding a signal sequence 

for polyadenylatlon of mRNA; said promoter being heterologous with respect to the structural coding sequence, 
wherein 

the 3' non-translated region may be selected from a gene from the group consisting of the genes from the T-DNA region 
55 of Agrobacterium tumefaclens, and wherein 

the 3' non-translated region may be from the nopaline synthase gene of Agrobacterium tumefaciens. 
The invention further relates to a chimeric gene capable of expressing a polypeptide in plant cells comprising in se- 
quence: 
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(a) a full-length transcript promoter region isolated from cauliflower mosaic virus; 

(b) a 5' non-translated region; 

(c) a structural coding sequence; 

(d) a 3' non-translated region of a gene naturally expressed in plants, said region encoding a signal sequence for 
5 polyadenylatlon of mRNA, said structural coding sequence being heterologous with respect to said promoter re- 
gion, wherein 

the 3' non-translated region may be from a nopaline synthase gene. 

The invention also relates to a culture of microorganisms identified by ATCC accession number 39265. 

10 [0092] The method used to assemble this chimeric gene is summarized in the flow chart of Figure 2, and described 
in detail below and in the examples. To assist the reader in understanding the steps of this method, various plasmids 
and fragments involved in the NOS-NPTII-NOS chimeric gene are cited In parentheses in Figure 2. However, the 
method of Figure 2 is applicable to a wide variety of other plasmids and fragments. To further assist the reader, the 
steps shown in Figure 2 have been assigned callout numbers 42 et seq. These callout numbers are cited in the following 

IS description. The techniques and DNA sequences of this invention are likely to be useful in the transformation of a wide 
variety of plants, including and plant which may be infected by one or more strains of A. tumefaclens or A. rhizogenes. 

The NOS Promoter Region and 5' Non-translated Region 

20 [0093] The subsequent description provides information relevant to the invention in general. However, chimeric 
genes comprising the NOS promoter are not claimed herein. 

[0094] The Applicants decided to obtain and utilize a nopaline synthase (NOS) promoter region to control the ex- 
pression of the heterologous gene. The NOS is normally carried in certain types of T plasmids. such as PTiT37 (Sciaky 
et al. 1978). The NOS promoter is normally inactive while in an A. tumefaciens cell. The entire NOS gene, including 
2S the promoter and the protein coding sequence, is within the T-DNA portion of a Ti plasmic that is inserted into the 
chromosomes of plant cells when a plant becomes infected and forms a crown gall tumor Once inside the plant cell, 
the NOS promoter region directs RNA polymerase within a plant cell, to transcribe the NOS protein coding sequence 
into mRNA, which is subsequently translated into the NOS enzyme. 

[0095] The boundaries between the different parts of a promoter region (shown in Figure 1 as association region 2, 

30 intervening region 4, transcription initiation sequence 6, and intervening region 8), and the boundary between the 
promoter region and the 5' non -translated region, are not fully understood. The Applicants decided to utilize the entire 
promoter regbn and 5' non -translated region from the NOS gene, which is known to be expressed in plant cells. 
However, it is entirety possible that one or more of these sequences might be modified in various ways. Such as 
alteration in length or replacement by other sequences. Such modifications in promoter regions and 5' non-translated 

35 regions have been studied in bacterial cells (see, e.g., Roberts et al 1979) and mammalian cells (see, e.g., McKnight, 
1982). By utilizing the methodology taught by this invention, it is now possible to study the effects of modifications to 
promoter regions and 5' non-translated regions on the expression of genes in plant cells. It may be possible to increase 
the exoression of a gene in a plant cell by means of such modifications. Such modifications, if performed upon chimeric 
genes of this invention, are within the scooe of this invention. 

40 [0096] A nopaline-type tumor-Inducing plasmid, designated as pTiT37. was isolated from a strain of A. tumefaciens 
using standard procedures (Currier and Nester, 1976). It was digested with the endonuclease Hindlll which produced 
numerous fragments. These fragments were separated by size on a gel, and one of the fragments was isolated and 
removed from the gel. This fragment was designated as the Hindlll-23 fragment, because it was approximately the 
23rd largest fragment from the Ti plasmid; it is approximately 3400 base pairs (bp) in size, also referred to as 3.4 

4S kilobases (kb). From work by others (see, e.g., Hernalsteens et al, 1980), it was known that the Hlndlll-23 fragment 
contained the entire NOS gene, including the promoter region, a 5' non-translated region, a structural sequence with 
a start codon and a stop codon, and a 3' non-translated region. The Hindlll-23 fragment is shown in Figure 3. 
[0097] By means of varbus cleavage and sequencing experiments, it was determined that the Hindlll-23 fragment 
could be digested by another endonuclease, Sau3a, to yield a fragment, about 350 bp in size, which contains the entire 

so NOS-promoter region, the 5' non-translated region, and the first few codons of the NOS structural sequence. This 
fragment was sequenced, and the base sequence is represented in Figure 4. The start codon (ATG) of the NOS struc- 
tural sequence begins at base pair 301 within the 350 bp fragment. 

[0098] The Applicants decided to cleave the fragment between base pairs 300 and 301; this would provide them 
with a fragment about 300 base pairs long containing a NOS promoter region and the entire 5' non-translated region 
55 but with no translated bases. To cleave the 350 bp fragment at precisely the right location, the Applicants obtained an 

Ml 3 clone designated as SI A. and utilized the procedure described below. 

[0099] To create the SIA clone, Dr. Michael Bevan of Washington University converted the 350 bp Sau3a fragment 
into a single strand of DNA. This was done by utilizing a virus vector, designated as the M13 mp2 phage, which goes 



11 



EP0131 623 B2 



through both double-stranded (ds) and single-stranded (as) stages in its lite cycle (Messing et al, 1981). The ds 350 
bp fragment was Inserted into the double-stranded repllcative form DNA of the Ml 3 mp2, which had been cleaved with 
BamHI. The two fragments were ligated, and used to Infect E. ^1 cells. The ds DNA containing the 350 bp inserted 
fragment subsequently replicated, and one strand (the viral strand) was encapsulated by the Ml 3 viral capsid proteins. 
5 In one clone, designated the SI A. the orientation of the 350 bp fragment was such that the anti-sense strand (containing 
the same sequence as the mRN A) of the NOS gene was carried In the viral strand. Viral particles released from infected 
cells were isolated, and provided to the Applicants. 

[0100] Single stranded SIA DNA. containing the anti-sense 350 bp fragment with the NOS promoter region, was 
isolated from the viral particles and sequenced. A 14-mer oligonucleotide primer was synthesized, using published 
10 procedures (Beaucage and Carruthers, 1981. as modified by Adams et al, 1982). This 14-mer was designed to be 
complementary to bases 287 through 300 ot the 350 bp fragment, as shown on Figure 4. 

[0101] The 5' end of the synthetic primer was radloactlvely labelled with ^^P\ this Is represented In the figures by an 
asterisk. 

[0102] Copies of the primer were mixed with copies of the single-stranded SIA DNA containing the anti-sense strand 
15 of the 350 bp fragment. The primer annealed to the desired region of the SIA DNA, as shown at the top of Figure 5. 
After this occurred, Klenow DNA polymerase and a controlled quantity of unlabelled deoxynucleoside triphosphates 
(dNTP's), A, T, C, and G, were added. Klenow polymerase added nucleotides to the 3' (unlabelled) end of the primer, 
but not to the 5' (labelled) end. The result, as shown In Figure 5, was a circular loop of single-stranded DNA. part of 
which was matched by a second strand of DNA. The 5* end of the second strand was located opposite base #300 of 
20 the Sau3a insert. 

[0103] The partially double-stranded DNA was then digested by a third endonuclease, Haelll, which can cleave both 
single-stranded and double-stranded DNA. Haelll cleavage sites were known to exist in several locations outside the 
350 bp insert, but none existed inside the 350 bp Insert. This created a fragment having one blunt end, and one 3' 
overhang which started at base #301 of the Sau3a insert. 
25 [0104] The Haelll fragment mixture was treated with T4 DNA polymerase and unlabelled dNTP's. This caused the 
single stranded portion of the DNA, which extended from base #X1 of the Sau3a Insert to the closest Haelll cleavage 
site, to be removed from the fragment. In this manner, the ATG start codon was removed from base pair #300, leaving 
a blunt end double-stranded fragment which was approximately 550 bp long. 

[0105] The mixture was then digested by a fourth endonuclease EcoRl, which cleaved the 550 bp fragment at a 
30 single site outside the NOS promoter region. The fragments were then separated by size on a gel, and the radioactively- 
labelled fragment was isolated. This fragment contained the entire NOS promoter region and 5' non-translated region. 
It had one blunt end with a sequence of 

5'-.,.CTGCA 
. . •GACGT 



40 and one cohesive end (at the EcoRI site) with a sequence of 

5' AATTC- 



The shorter strand was about 308 bp long. 

[0106] The foregoing steps are represented in Figure 2 as steps 42, 44, and 46. 

[0107] This fragment was Inserted intopMON40 (which is described below) to obtain pMON58, as shown on Figure 7. 

so 

Creation of plasmid with NPT II gene (pMON40) 

[0108] A bacterial transposon, designated as Tn5, is known to contain a complete NPT 1 1 gene, including the promoter 
region, structural sequence, and 3' non-translated region. The NPT II enzyme inactivates certain aminoglycoside an- 
55 tiblotics, such as kanamycin, neomycin, and G418; see Jimenez and Davles, 1980. This gene is contained within a 
1.8 kb fragment, which can be obtained by digesting phage lambda bbkan-1 DNA (D. Berg et al. 1975) with two en- 
donucleases, Hindlll and BamHI. This fragment was inserted into a common laboratory plasmid, pBR327, which had 
been digested by Hindlll and BamHI. As shown in Figure 6. the resulting plasmid was designated as pMONI 001 , which 
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was about 4.7 kb. 

[0109] To reduce the size of the DNA fragment which carried the NPT II structural sequence, the Applicants eliminated 
about 500 bp from the pMONIOOl plasmid, in the following manner. First, they digested pMONlOOl at a unique Smal 
restriction site which was outside of the NPT II gene. Next, they inserted a 10-mer synthetic oligonucleotide linker, 

5 

5' CCGGATCCGC 
GGCCTAGGCC 

10 

into the Smal cleavage site. This eliminated the Smal cleavage site and replaced it with a BamHI cleavage site. A 
second BamHI cleavage site already existed, about 500 bp from the new BamHI site. The Applicants digested the 
plasmid with BamHI, separated the 500 bp fragment from the 4.2 kb fragment, and circularized the 4.2 kb fragment. 
The resulting plasmids were inserted into E. coli^ which were then selected for resistance to ampiclHin and kanamycin. 
J5 A clonal colony of E. coli was selected; these cells contained a plasmid which was designated as pMON40, as shown 
in Figure 6. 

[0110] The foregoing steps are represented in Figure 2 as steps 48 and 50. 
Insertion of NOS promoter into plasmid pMON40 

20 

[0111] The Applicants deleted the NPT II promoter from pMON40. and replaced it with the NOS promoter fragment 
described previously, by the following method, shown on Figure 7. 

[0112] Previous cleavage and sequencing experiments (Rao and Rogers, 1979; Auerswald et al, 1980) indicated 
that a Bglll cleavage site existed in the NPT II gene between the promoter region and the structural sequence. Plasmid 
25 pMON40 was digested with Bglll. The cohesive ends were then filled in by mixing the cleaved plasmid with Klenow 
polymerase and the four dNTP's, to obtain the following blunt ends: 



5' - AGATC CATCT- 

50 

- TCTAC CTAGA-5' 



The polymerase and dNTP's were removed, and the cleaved plasmid was then digested with EcoRI. The smaller 
55 fragment which contained the NPT 11 promoter region was removed, leaving a large fragment with one EcoRI end and 
one blunt end. This large fragment was mixed with the 308 bp fragment which contained the NOS promoter, described 
previously and shown on Figure 5. The fragments were ligated, and inserted into E. coli. E. coli clones were selected 
for ampiclllin resistance. Replacement of the NPT II promoter region (a bacterial promoter) with the NOS promoter 
region (which is believed to be active only in plant cells) caused the NPT 11 structural sequence to become inactive in 
40 E, coli, Plasmids from 36 kanamycin-sensltive clones were obtained; the plasmid from one clone, designated as 
pMON58, was utilized in subsequent work. 

[0113] The foregoing steps are represented In Figure 2 as steps 52 and 54. 

[0114] Plasmid pf^ON58 may be digested to obtain a 1 .3 kb EcoRI-BamHI fragment which contains the NOS promoter 
region, the NOS 5' non-translated region, and the NPT II structural sequence. This step is represented in Figure 2 as 
45 step 56. 

Insertion of NOS 3' sequence into NPT II gene 

[0115] As mentioned above in "Background Art", the functions of 3' non-translated regions in eucaryotic genes are 
so not fully understood. However, they are believed to contain at least one important sequence, a poly-adenylatlon signal. 
[0116] It was suspected by the Applicants that a gene having a bacterial 3' non-translated region might not be ex- 
pressed as effectively in a plant cell as the same gene having a 3' non-translated region from a gene, such as NOS, 
which is known to be expressed in plants. Therefore, the Applicants decided to add a NOS 3' non -translated region to 
the chimeric gene, in addition to the NPT II 3* non-translated region already present. Alternately. It is possible, using 
55 the methods described herein, to delete the NPT H or other existing 3' non-translated region and replace it with a 
desired 3" non-translated region that is known to be expressed In plant cells. Whether a different type of 3' non-translated 
region (such as a 3' region from an octopine-type or agropine-type Ti plasmid, or a 3' region from a gene that normally 
exists in a plant cell) would be suitable or preferable for use in any particular type of chimeric gene, for use In any 
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specific type of plant cell, may be deteimined by those skilled in the art through routine experimentation using the 
method of this invention. 

[0117] Those skilled in the art may also determine through routine experimentation whether the 3' non-translated 
region that naturally follows a structural sequence that is to be inserted into a plant cell will enhance the efficient 

s expression of that structural sequence in that type of plant cell. If so. then the steps required to insert a different 3' non- 
translated region into the chimeric gene might not be required In order to perform the method of this invention. 
[0118] In order to obtain a DNA fragment containing a NOS 3' non-translated region appropriate for joining to the 
NPT II structural sequence from pMON58 (described previously), the Applicants utilized a 3.4 kb Hindlll-23 fragment 
from a Ti plasmid, shown on Figure 3. This 34 kb fragment was isolated and digested with BamHI to obtain a 1.1 kb 

10 BamHI-Hindlll fragment containing a 3' portion of the NOS structural sequence (including the stop codon), and the 3' 
non -translated region of the NOS gene (including the poly-adenylation signal). This 1.1 kb fragment was inserted Into 
a pBR327 plasmid which had been digested with Hindlll and BamHI. The resulting plasmid was designated as pMON42, 
as shown on Figure 6. 

[0119] Plasmid pMON42 was digested with BamHI and Rsal, and a 720 bp fragment containing the desired NOS 3' 
IS non-translated region was purified on a gel. The 720 bp fragment was digested with another endonuclease, Mbol, and 
treated with the large fragment of E, coli DNA polymerase I. This resulted in a 260 bp fragment with Mbol blunt ends, 
containing a large part of the NOS 3' non-translated region including the poly-A signal. 

[0120] The foregoing procedure is represented in Figure 2 by step 58. However, it is recognized that alternate means 
could have been utilized; for example, it might have been possible to digest the Hindlll-23 fragment directly with Mbol 
20 to obtain the desired 260 bp fragment with the NOS 3' non-translated region. 

Assembly of Chimeric Gene 

[0121] To complete the assembly of the chimeric gene, it was necessary to ligate the 260 bp Mbol fragment (which 
25 contained the NOS 3' non-translated region) to the 1 .3 kb EcoRI-BamHI fragment from pMON58 (which contained the 
NOS promoter region and 5' non-translated region and the NPT II structural sequence). In order to facilitate this ligation 
and control the orientation of the fragments, the Applicants decided to convert the Mbol ends of the 260 bp fragment 
into a BamHI end (at the 5' end of the fragment) and an EcoRI end (at the 3' end of the fragment). In order to perform 
this step, the Applicants used the following method. 
30 [0122] The 260 bp Mbol fragment, the termini of which had been converted to blunt ends by Klenow polymerase, 
was inserted into M13 mp8 DNA at a Smal cleavage site. The Smal site is surrounded by a variety of other cleavage 
sites present in the M13 mp8 DNA, as shown in Figure 8. The Mbol fragment could be inserted into the blunt Smal 
ends in either orientation. The orientation of the Mbol fragments in different clones were tested, using Hinfl cleavage 
sites located assymetrically within the Mbol fragment. A clone was selected in which the 3' end of the NOS 3' non- 
55 translated region was located near the EcoRI cleavage site in the M13 mp8 DNA. This clone was designated as the 
M-2 clone, as shown in Figure 8. 

[0123] Replicative form (double stranded) DNA from the M-2 clone was digested by EcoRI and BamHI and a 280 
bp fragment was isolated. Separately, plasmid pMON58 was digested by EcoRI and BamHI, and a 1 300 bp fragment 
was isolated. The two fragments were ligated, as shown in Figure 9. to complete the assembly of a NOS-NPTII-NOS 

40 chimeric gene having EcoRI ends. 

[0124] There are a variety of ways to control the ligation of the two fragments. For example, the two EcoRI-BamHI 
fragments could be joined together with DNA ligase and cleaved with EcoRI. After inactivation of EcoRI, a vector 
molecule having EcoRI ends that were treated with calf alkaline phosphatase (CAP) may be added to the mixture. The 
fragments in the mixture may be ligated in a variety of orientations. The plasmid mixture is used to transform E, coli^ 

45 and cells having plasmids with the desired orientation are selected or screened, as described below. 

[0125] A plasmid, designated as pMON38, was created by insertion of the Hindlll-23 fragment (from Ti plasmid 
pTiT37) into the Hindlll cleavage site of the plasmid pBR327. Plasmid pMON38 contains a unique EcoRI site, and an 
ampicillin-resistance gene which is expressed in E. coli. Plasmid pMON38 was cleaved with EcoRI and treated with 
alkaline phosphatase to prevent it from re-ligaling to itself. U.S. Patent 4,264,731 (Shine, 1 981 ). The resulting fragment 

so was mixed with the 1 300 bp NOS-NPTII fragment from pMON58, and the 280 bp NOS fragment from M-2, which had 
been ligated and EcoRI -cleaved as described in the previous paragraph. The fragments were ligated, and inserted 
into E. coli. The E. coN cells which had acquired intact plasmids with ampicillin-resistance genes were selected on 
plates containing ampicillin. Several clones were selected, and the orientation of the inserted chimeric genes was 
evaluated by means of cleavage experiments. Two clones having plasmids carrying NOS-NPT ll-NOS inserts with 

ss opposite orientations were selected and designated as pMON75 and pMON76, as shown In Figure 9. The chimeric 
gene may be isolated by digesting either pMON75 or pMON76 with EcoRI and purifying a 1580 bp fragment. 
[0126] The foregoing procedure is represented on Figure 2 by step 60. 

[0127] This completes the discussion of the NOS-NPTII-NOS chimeric gene. Additional Information on the creation 
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of this gene is provided in the Examples. A copy of this chimeric gene is contained in plasmid pMON128; it may be 
removed from pMON128 by digestion with EcoRI. A culture of Ecoli containing pMON128 has been deposited with 
the American Type Culture Collectbn; this culture has been assigned accession number 39264. 
[0128] To prove the utility of this chimeric gene, the Applicants inserted it into plant cells. The NPTII structural se- 
5 quence was expressed in the plant cells, causing them and their descendants to acquire resistance to concentrations 
of kanamycin which are normally toxic to plant cells. 

Creation of NPT I Chimeric Gene 

10 [0129] A chimeric gene was created comprising (1 ) a NOS promoter region and 5' non-translated region, [2) a struc- 
tural sequence which codes for NPT I, and (3) a NOS 3* non-translated region. 

[0130] NPT I and NPT II are different and distinct enzymes with major differences In their amino acid sequences and 
substrate specificities. See, e.g., E. Beck et al, 1982. The relative stabilities and activities of these two enzymes in 
various types of plant cells are not yet fully understood, and NPT I may be preferable to NPT II for use In certain types 

is of experiments and plant transformations. 

[0131] A 1200 bp fragment containing an entire NPT I gene was obtained by digesting pACY177 (Chang and Cohen, 
1978) with the endonuclease. Avail. The Avail termini were converted to blunt ends with Klenow polymerase, and 
converted to BamHI termini using a synthetic linker. This fragment was inserted into a unique BamHI site In a 
pBR327-derived plasmid, as shown in Figure 11. The resulting plasmid was designated as pMON66. 

20 [0132] Plasmid pMON57 (a deletion derivative of pBR327, as shown in Figure 11 ) was digested with Avail. The 225 
bp fragment of pMON57 was replaced by the analogous 225 bp Avail fragment taken from plasmid pUC8 (Vieira and 
Messing, 1 982), to obtain a derivative of pMON57 with no PstI cleavage sites. This plasmid was designated as pMON67. 
[01 33] Plasmid pMON58 (described previously and shown in Figure 7) was digested with EcoRI and BamHI to obtain 
a 1300 bp fragment carrying the NOS promoter and the NPT II structural sequence. This fragment was inserted into 

25 pMON67 which had been digested with EcoRI and BamHI. The resulting plasmid was designated as pMON73, as 
shown in Figure 1 2. 

[0134] pMON73 was digested with PstI and BamHI, and a 2.4 kb fragment was isolated containing a NOS promoter 
region and 5' non -translated region. Plasmid pMON66 (shown on Figure 11) was digested with Xhol and BamHI to 
yield a 950 bp fragment containing the structural sequence of NPT I. This fragment lacked about 30 nucleotides at the 

30 5' end of the structural sequence. A synthetic linker containing the missing bases, having appropriate PstI and Xhol 
ends, was created. The pMON73 fragment, the pMON66 fragment, and the synthetic linker were llgated together to 
obtain plasmid pMON78, as shown in Figure 1 3. This plasmid contains the NOS promoter region and 5' non -translated 
region joined to the NPT I structural sequence. The ATG start codon was in the same position that the ATG start codon 
of the NOS structural sequence had occupied. 

35 [0135] Plasmid pMON78 was digested with EcoRI and BamHI to yield a 1300 bp fragment carrying the chimerk; 
NOS-NPT I regions. Doyle-stranded DNA from the M-2 clone (described previously and shown on Figure 9) was di- 
gested with EcoRI and BamHI. to yield a 280 bp fragment carrying a NOS 3' non-translated region with a poly-ade- 
nylatlon signal. The two fragments described above were llgated together to create the NOS-NPT l-NOS chimeric 
gene, which was inserted into plasmid pMON38 (described above) which had been digested with EcoRI. The two 

40 resulting plasmlds, having chimeric gene inserts with opposite orientations, were designated as pMON106 and 
pMON107. as shown in Figure 14. 

[0136] Either of plasmlds pMONIOS or pMON107 may be digested with EcoRI to yield a 1.6 kb fragment containing 
the chimeric NOS-NPT l-NOS gene. This fragment was inserted Into plasmid pMON120 which had been digested with 
EcoRI and treated with alkaline phosphatase. The resulting plasmlds, having inserts with opposite orientations, were 
45 designated as pMONI 30 and pMONI 31 , as shown on Figure 1 5. 

[0137] The NOS-NPT l-NOS chimeric gene was inserted into plant cells, which acquired resistance to kanamycin. 
This demonstrates expression of the chimeric gene In plant cells. 

Creation of Chimeric Gene with Soybean Promoter 

50 

[0138] In one embodiment of this invention, a chimeric gene was created comprising (1) a promoter region and 5' 
non -translated region taken from a gene which naturally exists in soybean; this gene codes for the small subunit of 
ribuiose-1 ,5-bis-phosphate carboxylase (sbss, for soybean small subunit); (2) a structural sequence which codes for 
NPT II. and (3) a NOS 3' non -translated region. 
ss [0139] The sbss gene codes for a protein in soybean leaves which is involved in photosynthetic carbon fixation. The 
sbss protein is the most abundant protein in soybean leaves (accounting for about 10% of the total leaf protein), so it 
is likely that the sbss promoter region causes prolific transcription. 

[0140] There are believed to be approximately six genes encoding the sbss protein In the soybean genome. One of 
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the members of the sbss gene family. SRS1, which is highly transcribed in soybean leaves, has been cloned and 
characterized. The promoter region. 5' nonlranslated region, and a portion of the structural sequence are contained 
on a 2.1 kb EcoRI fragment that was subcloned Into the EcoRI site of plasmid pBR325 (Bolivar, 1978). The resultant 
plasmid, pSRS2. 1 , was a gift to f^onsanto Company from Dr. R. 8. Meagher, University of Georgia, Athens, CA. The 
5 2.1 kb EcoRI fragment from pSRS2.1 Is shown on Figure 16. 

[0141] Plasmid pSRS2.1 was prepared from dam -E. coli cells, and cleaved with Mbol to obtain an 800 bp fragment. 
This fragment was inserted into plasmid pKC7 {Rao and Rogers. 1 979) which had been cleaved with Bgll I. The resulting 
plasmid was designated as pMON121, as shown on Figure 17. 

[0142] Plasmid pMON121 was digested with EcoRI and Bell, and a 1200 bp fragment containing the sbss promoter 
10 region was isolated. Separately plasmid pfy^ON75 (described previously and shown on Figure 9) was digested with 
EcoRI and Bgll I, and a 1250 bp fragment was isolated, containing a NPT II structural sequence and a NOS 3* non- 
translated region. The two fragments were ligated at the compatible Bcll/Bglll overhangs, to create a 2450 bp fragment 
containing sbss-NPT ll-NOS chimeric gene. This fragment was inserted into pl\/ION120 which had been cleaved with 
EcoRI, to create two plasmids having chimeric gene inserts with opposite orientations, as shown in Figure 18. The 
15 plasmids were designated as pMON141 and pMON142. 

[0143] The sbss-NPTII-NOS chimeric genes were inserted into several types of plant cells, causing the plant cells 
to acquire resistance to kanamycin. 

[0144] This successful transfornnation proved that a promoter region from one type of plant can cause the expression 
of a gene within plant cells from an entirely different genus, family and order of plants. 

20 [0145] The chimeric sbss-NPT ll-NOS gene also had another significant feature. Sequencing experiments indicated 
that the 800 bp hAbo\ fragment contained the ATG start codon of the sbss structural sequence. Rather than remove 
this start codon, the Applicants decided to insert a stop codon behind it in the same reading frame. This created a 
dicistronic mRNA sequence, which coded for a truncated amino portion of the sbss polypeptide and a complete NPT 
II polypeptide. Expression of the NPT It polypeptide was the first proof that a dicistronic mRNA can be translated within 

25 plant cells. 

[0146] The sbss promoter is contained in plasmid pMON154, described below. A culture of E, coli containing this 
plasmid has been deposited with the American Type Culture Collection. This culture has been assigned accession 
number 39265. 

30 Creation of BGH Chimeric Genes 

[0147] A chimeric gene was created comprising (1 ) a sbss promoter region and 5' non-translated region. (2) a struc- 
tural sequence which codes for bovine growth hormone (BGH) and (3) a NOS 3' non-translated region. This chimeric 
gene was created as follows. 

35 [0148] A structural sequence which codes for the polypeptide, bovine growth hormone, (see. e.g., Woychik et al, 
1 982) was inserted Into a pBR322-derived plasmid. The resulting plasmid was designated as plasmid CF-1 . This plas- 
mid was digested with EcoRI and Hindi! I to yield a 570 bp fragment containing the structural sequence. Double stranded 
RF DNA (described previously and shown in Figure 8) was cleaved with EcoRI and Hindlll to yield a 290 bp 
fragment which contained the NOS 3' non-translated region with a poly-adenylation signal. The two fragments were 

40 ligated together and digested with EcoRI to create an 860 base pair fragment with EcoRI ends, which contained a 
BGH-coding structural sequence joined to the NOS 3' non-translated region. This fragment was introduced into plasmid 
pMON38, which had been digested with EcoRI and treated with alkaline phosphatase, to create a new plasmid, des- 
ignated as pMON 108, as shown in Figure 19. 

[0149] A unique Bglll restruction site was introduced at the 5' end of the BGH structural sequence by digesting pMON 
45 108 with EcoRI to obtain the 860 bp fragment, and using Klenow polymerase to create blunt ends on the resulting 
EcoRI fragment. This fragment was ligated into plasmid N25 (a derivative of pBR327 containing a synthetic linker 
carrying Bglll and Xbal cleavage sites inserted at the BamHI site), which had been cleaved with Xbal and treated with 
Klenow polymerase to obtain blunt ends (N25 contains a unique Bglll site located 12 bases from the Xbal site). The 
resulting plasmid, which contained the 860bp BGH-NOS fragment in the orientation shown in Figure 20, was designated 
50 as plasmid N25-BGH. This plasmid contains a unique Bglll cleavage site located about 25 bases from the 5' end of 
the BGH structural sequence. 

[0150] Plasmid N25-BGH prepared from dam- E. coli cells was digested with Bglll andClal to yield an 860 bp fragment 
which contained the BGH structural sequence joined to the NOS 3' non-translated region. Separately, plasmid 
pMON121 (described previously and shown In Figure 17) was prepared from dam- E. coli cells and was digested with 
55 Clal and Bel! to create an 1100 bp fragment which contained the sbss promoter region. The fragments were ligated at 
their compatible Bcll/Bglll overhangs, and digested with Clal to yield a Clal fragment of about 2 kb containing the 
chimeric sbss-BGH-NOS gene. This fragment was inserted into pMON120 (described previously and shown in Figure 
10) which had been digested with Clal. The resulting plasmids, containing the inserted chimeric gene in opposite 
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orientations were designated pMON147 and pMON148, as shown in Figure 21. 

[0151] An alternate chimeric BGH gene was created which contained (1 ) a NOS promoter region and 5* non-trans- 
lated region. (2) a structural sequence which codes for BGH, and (3) a NOS 3' non-translated region, by the following 
method, shown In Figure 22. 

5 [0152] Plasmid pMON76 (described above and shown in Figure 9) was digested with EcoRI and Bglll to obtain a 
308 bp fragment containing a NOS promoter region and 5' non-translated region. Plasmid N25-BGH prepared from 
dam- E,co!i cells (described above and shown in Figure 20) was digested with Bglll and Clal to obtain a 900 bp fragment 
containing a BGH structural sequence and a NOS 3' non-translated region. These two fragments were ligated together 
to obtain a chimeric NOS-BGH-NOS gene in a fragment with EcoRI and Clal ends. This fragment was ligated with an 

10 8 kb fragment obtained by digesting pMON120 with EcoRI and Clal. The resulting plasmid, designated as pMON149, 
is shown in Figure 22. 

Creation of Chimeric NOS-EPSP-NOS Gene 

IS [0153] A chimeric gene was created comprising (1 ) a NOS promoter region and 5' non-translated region, (2) a struc- 
tural sequence which codes for the E, coli enzyme. 5-enol pyruvyl shikimate-3-phosphoric acid synthase (EPSP syn- 
thase) and (3) a NOS 3' non-translated region. 

[0154] EPSP synthase is believed to be the target enzyme for the herbicide, glyphosate, which is marketed by Mon- 
santo Company under the registered trademark, "Roundup." Glyphosate is known to inhibit EPSP synthase activity 
20 (Amrhein et al, 1980), and amplification of the EPSP synthase gene in bacteria is known to increase their resistance 
to glyphosate. Therefore, increasing the level of EPSP synthase activity in plants may confer resistance to glyphosate 
in transformed plants. Since glyphosate is toxic to most plants, this provides for a useful method of weed control. Seeds 
of a desired crop plant which has been transformed to increase EPSP synthase activity may be planted in a field. 
Glyphosate may be applied to the field at concentrations which will kill all non-transformed plants, leaving the non- 
25 transformed plants unharmed. 

[0155] An EPSP synthase gene may be isolated by a variety of means, including the following. A lambda phage 
library may be created which carries a variety of DNA fragments produced by Hindlll cleavage of JE coli DMA. See, e. 
g., Maniatis et al, 1982. 

[0156] The EPSP synthase gene is one of the genes which are involved in the production of aromatic amino acids. 
30 These genes are designated as the "aro" genes; EPSP synthase is designated as aroA . Cells which do not contain 
functional aro genes are designated as aro- cells. Aro- cells must normally be grown on media supplemented by aro- 
matic amino acids. See Pittard and Wallis, 1 966. 

[0157] Different lambda phages which carry various Hindlll fragments may be used to infect mutant J|. colj cells 
which do not have EPSP synthase genes. The infected aro- cells may be cultured on media which does not contain 

3S the aromatic amino acids, and transformed a ro+ clones which are capable of growing on such media may be selected. 
Such clones are likely to contain the EPSP synthase gene. Phage particles may be isolated from such clones, and 
DNA may be isolated from these phages. The phage DNA may be cleaved with one or more restriction endonucleases, 
and by a gradual process of analysis, a fragment which contains the EPSP synthase gene may be isolated. 
[0158] Using a procedure similar to the method summarized above, the Applicants isolated an 11 kb Hindlll fragment 

40 which contained the entire E. coli EPSP synthase gene. This fragment was digested with Bglll to produce a 3.5 kb 
Hindlll-Bglll fragment which contained the entire EPSP synthase gene. This 3.5 kb fragment was inserted into plasmid 
pkC7 (Rao and Rogers, 1979) to produce plasmid pMON4, whch is shown in Figure 23. 

[0159] Plasmid pMON4 was digested with Clal to yield a 2.5 kb fragment which contained the EPSP synthase struc- 
tural sequence. This fragment was inserted into pBR327 that had been digested with Clal. to create pMONB. as shown 
45 in Figure 23. 

[0160] pMON8 was digested with BamHI and Ndel to obtain a 4.9 kb fragment. This fragment lacked about 200 
nucleotides encoding the amino terminus of the EPSP synthase structural sequence. 

[0161] The missing nucleotides were replaced by ligating a Hinfl/Ndel fragment, obtained from pMONB as shown in 
Figure 24, together with a synthetic oligonucleotide sequence containing (1) the EPSP synthase start codon and the 
so first three nucleotides, (2) a unique Bglll site, and (3) the appropriate BamHI and Hinfl ends. The resulting plasmid, 
pMON25, contains an intact EPSP synthase structural sequence with unique BamHI and Bglll sites positioned near 

the start codon. 

[0162] Double stranded M-2 DNA (described previously and shown in Figure 8) was digested with Hindlll and EcoRI 
to yield a 290 bp fragment which contains the NOS 3' non-translated region and poly-adenylation signal. This fragment 
ss was introduced into a pMON25 plasmid that had been digested with EcoRI and Hindlll to create a plasmid. designated 
as pMON146 (shown in Figure 25) which contains the EPSP structural sequence joined to the NOS 3' non-translated 
region. 

[0163] pMON146 was cleaved with Clal and Bglll to yield a 2.3 kb fragment carrying the EPSP structural sequence 
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joined to the NOS 3' non-translated region. pMON76 (described previously and shown in Figure 9) was digested with 
Bgtll and EcoRI to create a 310 bp fragment containing the NOS promoter region and 5' non-translated region. The 
above fragments were mixed with pMON120 (described previously and shown in Figure 10) that had been digested 
with Clal and EcoRI, and the mixture was ligated. The resulting plasmid, designated pMON153, is shown in Figure 26. 

s This plasmid contains the chimeric NOS-EPSP-NOS gene. 

[01 64] A plasmid containing a chimeric sbss-E PSP-NOS gene was prepared in the following manner, shown in Figure 
27. Plasmid pMON146 (described previously and shown in Figure 25) was digested with Clal and Bglll. and a 2.3 kb 
fragment was purified. This fragment contained the EPSP synthase structural sequence coupled to a NOS 3' non- 
translated region with a poly-adenylation signal. Plasmid pMON121 (described above and shown in Figure 17) was 

10 digested with Clal and Bell, and a 1.1 kb fragment was purified. This fragment contains an sbss promoter region and 
5' non -translated region. The two fragments were mixed and ligated with T4 DNA ligase and subsequently digested 
with Clal. This created a chimeric sbss-EPSP-NOS gene, joined through compatible Bglll and Bell termini. This chimeric 
gene with Clal termini was inserted Into plasmid pMON120 which had been digested with Clal and treated with calf 
alkaline phosphatase (CAP). The mixture was ligated with T4 DNA ligase. The resulting mixture of fragments and 

16 plasmids was used to transform^ coli cells, which were selected for resistance to spectinomycin. A colony of resistant 
cells was isolated, and the plasmid in this colony was designated as pMON154, as shown in Figure 27. 
[0165] A culture of E. coll containing pMON154 has been deposited with the American Type Culture Center, This 
culture has been assigned accession number 39265. 

20 Creation of Chimeric CaMV(32S)-NPT ll-NOS Genes 

[0166] In an alternate preferred embodiment of this invention, a chimeric gene was created comprising 

(1) a promoter region whk;h causes transcription of the 32S CaMV mRNA; 
25 (2) a structural sequence which codes for NPT II; and 

(3) a NOS 3' non-translated region. 

[0167] The assembly of this chimeric gene is described in Example 11 and Figures 33 through 37. This gene was 
inserted into plant cells and it caused them to become resistant to kanamycin. 
30 [0168] Petunia plants cannot normally be Infected by CaMV. Those skilled in the art may determine through routine 
experimentation whether any particular plant viral promoter (such as the CaMV promoter) will function at satisfactory 
levels in any particular type of plant cell, including plant cells that are outside of the normal host range of the virus from 
which the promoter was derived. 

Means for Inserting Chimeric Genes Into Plant Cells 

[0169] A variety of methods are known for inserting foreign DNA into plant cells. One such method, utilized by the 
Applicants, involved inserting a chimeric gene into Ti plasmids carried by A. tumefaciens. and co-cultivating the A. 
tumefaciens cells with plants. A segment of T-DNA carrying the chimeric gene was transferred into the plant genome. 
40 causing transformation. This method is described in detail in two separate U.S. patent applications entitled "Plasmids 
for Transforming Plant Cells," serial number 458,411, (WO84/02919) and "Genetically Transformed Plants," serial 
number 458.402, (WO84/02920) both of which were filed on January 17, 1983. 

[0170] A variety of other methods are listed below. These methods are theoretically capable of inserting the chimeric 
genes of this invention into plant cells, although the reported transformation efficiencies achieved to date by such 
45 methods have been low. The chimeric genes of this invention (especially those chimeric genes such as NPT I and NPT 
II, which may be utilized as selectable markers) are likely to facilitate research on methods of inserting DNA into plants 
or plant cells. 

[0171] 1. One alternate technique for inserting DNA into plant cells involves the use of lipid vesicles, also called 
liposomes. Liposomes may be utilized to encapsulate one or more DNA molecules. The liposomes and their DNA 
50 contents may be taken up by plant cells; see, e.g., Lurquin, 1981. If the inserted DNA can be incorporated into the 
plant genome, replicated, and inherited, the plant cells will be transformed. 

[0172] To date, efforts to use liposomes to deliver DNA into plant cells have not met with great success (Fraley and 
Papahadjopoulos, 1981). Only relatively small DNA molecules have been transferred into plant cells by means of 
liposomes, and none have yet been expressed. However, liposome-delivery technology is still being actively developed, 
55 and it is likely that methods will be developed for transferring plasmkJs containing the chimeric genes of this invention 

into plant cells by means involving liposomes. 

[01 73] 2. Other alternate techniques involve contacting plant cells with DNA which Is complexed with either (a) poly- 
cationic substances, such as poly-L-omithine (Davey et al. 1 980), or (b) calcium phosphate (Krens et al, 1 982). Although 
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efficiencies of transformation achieved to date have been low, these methods are still being actively researched. 
[0174] 3. A method has been developed involving the fusion of bacteria, which contain desired plasmids, with plant 
cells. Such methods involve converting the bacteria Into spheroplasts and converting the plant cells into protoplasts. 
Both of these methods remove the cell wall barrier from the bacterial and plant cells, using enzymic digestion. The two 

s cell types can then be fused together by exposure to chemical agents, such as polyethylene glycol. See Hasezawa et 
al, 1981. Although the transformation efficiencies achieved to date by this method have been low, similar experiments 
using fusions of bacterial and animal cells have produced good results; see Rassoulzadegan et al, 1982. 
[0175] 4. Two other methods which have been used successfully to genetically transform animal cells involve (a) 
direct microinjection of DN A into animal cells, using very small glass needles (Capecchi. 1 980), and (b) electric-current- 

10 induced uptake of DNA by animal cells (Wong and Neumann, 1 982). Although neither of these techniques have been 
utilized to date to transform plant cells, they may be useful to insert chimeric genes of this invention into plant cells. 

Meaning of Various Phrases 

IS [0176] A variety of phrases which are used in the claims must be defined and described to clarify the meaning and 
coverage of the claims. 

[0177] The meaning of any particular term shall be Interpreted with reference to the text and figures of this application. 
In particular, it is recognized that a variety of terms have developed which are used Inconsistently in the literature. For 
example, a variety of meanings have evolved for the term "promoter." some of which include the 5' non-translated 
20 region and some of which do not. In an effort to avoid problems of Interpretation, the Applicants have attempted to 
define various terms. However, such definitions are not presumed or intended to be comprehensive and they shall be 
interpreted in light of the relevant literature. 

[0178] The term "chimeric gene" refers to a gene that contains at least two portions that were derived from different 
and distinct genes. As used herein, this term is limited to genes which have been assembled, synthesized, or otherwise 
25 produced as a result of man-made efforts, and any genes which are replicated or otherwise derived therefrom, "i^an- 
made efforts" include enzymatic, cellular, and other biological processes, if such processes occur under conditions 
which are caused, enhanced, or controlled by human effort or intervention; this excludes genes which are created 
solely by natural processes. 

[0179] As used herein, a "gene" is limited to a segment of DNA which is normally regarded as a gene by those skilled 
30 in the art. For example, a plasmid might contain a plant-derived promoter region and a heterologous structural se- 
quence, but unless those two segments are positioned with respect to each other in the plasmid such that the promoter 
region causes the transcription of the structural sequence, then those two segments would not be regarded as included 
in the same gene. 

[0180] This invention relates to chimeric genes which have structural sequences that are "heterologous" with respect 
35 to their promoter regions. This includes at least two types of chimeric genes: 

1 . DNA of a gene which is foreign to a plant cell. For example, if a structural sequence which codes for mammalian 
protein or bacterial protein is coupled to a plant promoter region, such a gene would be regarded as heterologous. 

2. A plant cell gene which is naturally promoted by a different plant promoter region. For example, if a structural 
40 sequence which codes for a plant protein is normally controlled by a tow-quantity promoter, the structural sequence 

may be coupled with a prolific promoter This might cause a higher quantity of transcription of the structural se- 
quence, thereby leading to plants with higher protein content. Such a structural sequence would be regarded as 
heterologous with regard to the prolific promoter. 

45 [0181] However, it is not essential for this invention that the entire structural sequence be heterologous with respect 
to the entire promoter region. For example, a chimeric gene of this invention may be created which would be translated 
into a "fusion protein", i.e., a protein comprising polypeptide portions derived from two separate structural sequences. 
This may be accomplished by inserting all or part of a heterologous structural sequence into the structural sequence 
of a plant gene, somewhere after the start codon of the plant structural sequence. 

so [0182] As used herein, the phrase, "a promoter region derived from a specified gene" shall include a promoter region 
if one or more parts of the promoter region were derived from the specified gene. For example, it might be discovered 
that one or more portions of a particular plant-derived promoter region (such as intervening region 8, shown on Figure 
1) might be replaced by one or more sequences derived from a different gene, such as the gene that contains the 
heterologous structural sequence, without reducing the expression of the resulting chimeric gene in a particular type 

55 of host cell. Such a chimeric gene would contain a plant-derived association region 2, intervening region 4, and tran- 
scription Initiation sequence 6, followed by heterologous intervening regkjn 8, 5' non-translated region 10 and structural 
sequence 14. Such a chimeric gene is within the scope of this Invention. 

[0183] As used herein, the phrase "derived from" shall be construed broadly. For example, a structural sequence 



19 



EP 0131 623 B2 

may be "derived from" a particular gene by a variety of processes, including the following: 

1. the gene may be reproduced by various means such as inserting it Into a plasmid and replicating the plasmid 
by cell culturing, in vitro replication, or other methods, and the desired sequence may be obtained from the DNA 

5 copies by various means such as endonuclease digestion; 

2. mRNA which was coded for by the gene may be obtained and processed in various ways, such as preparing 
complementary DNA from the mRNA and then digesting the cDNA with endonucleases; 

3. the sequence of bases in the structural sequence may be determined by various methods, such as endonuclease 
mapping or the Maxam-Gilbert method. A strand of DNA which duplicates or approximates the desired sequence 

10 may be created by various methods, such as chemical synthesis or ligation of oligonucleotide fragments. 

4. a structural sequence of bases may be deduced by applying the genetic code to the sequence of amino acid 
residues in a polypeptide. Usually, a variety of DNA structural sequences may be determined for any polypeptide, 
because of the redundancy of the genetic code. From this variety, a desired sequence of bases may be selected, 
and a strand of DNA having the selected sequence may be created. 

1$ 

[0184] If desired, any DNA sequence may be modified by substituting certain bases for the existing bases. Such 
modifications may be performed for a variety of reasons. For example, one or more bases in a sequence may be 
replaced by other bases in order to create or delete a cleavage site for a particular endonuclease. As another example, 
one or more bases in a sequence may be replaced in order to reduce the occurrence of "stem and loop" structures in 
20 messenger RNA. Such modified sequences are within the scope of this invention. 

[0185] A structural sequence may contain Introns and exons; such a structural sequence may be derived from DNA, 
or from an mRNA primary transcript. Alternately, a structural sequence may be derived from processed mRNA, from 
which one or more Introns have been deleted. 

[0186] The Applicants have deposited two cultures of E. coll cells containing plasmids pMON128 and pMON154 with 
25 the American Type Culture Collection (ATCC). These cells have been assigned ATCC accession numbers 39264 and 
39265, respectively. 

[0187] Those skilled In the art will recognize, or be able to ascertain using no more than routine experimentation, 
numerous equivalents to the specific embodiments described herein. Such equivalents are within the scope of this 
invention. 

30 

EXAMPLES 

Example 1 Creation of dI\/ION1001 

35 [0188] Fifty micrograms (ug) of lambda phage bbkan-1 DNA (Berg et al, 1 975) were digested with 1 00 units of Hindlll 
(all restriction endonucleases were obtained from New England Biolabs, Beverly, MA, and were used with buffers 
according to the suppliers instructions, unless otherwise specified) tor 2 hr at 37° C. After heat-inactivation (70°C, 10 
min), the 3.3 Kb Tn5 Hindlll fragment was purified on a sucrose gradient. One ug of the purified Hindlll fragment was 
digested with BamHI (2 units. 1 hr, 37** C), to create a 1.8 kb fragment. The endonuclease was heat inactivated. 

40 [0189] Plasmid pBR327 (Soberon et al, 1981). 1 ug, was digested with Hindlll and BamHI (2 units each, 2 hr. 37** 
C). Following digestion, the endonucleases were heat inactivated and the cleaved pBR327 DNA was added to the 
BamHI-Hindlll TnS fragments. After addition of ATP to a concentration of 0.75mM. 1 0 units of T4 DNA ligase (prepared 
by the method of Murray et al. 1979) was added, and the reaction was allowed to continue for 16 hours at 12-14° C. 
One unit of T4 DNA ligase will give 90% circularization of one ug of Hindi 1 1 -cleaved pBR327 plasmid in 5 minutes at 

45 22" C. 

[0190] The ligated DNA was used to transform CaClg-shocked E coli C600 rec A56 cells (Maniatis et al, 1982). After 
expression in Luria broth (LB) for 1 hour at 37° C the cells were spread on solid LB media plates containing 200 ug/ 
ml ampicillin and 40 ug/ml kanamycin. Following 16 hour incubation at 37° C, several hundred colonies appeared. 
Plasmid mini-prep DNA was prepared from six of these. (Ish-Horowicz and Burke, 1981). Endonuclease digestion 
so showed that all six of the plasmids carried the 1 .8 kb Hindlll-BamHI fragment. One of those isolates was designated 
as pMONIOOl as shown in Figure 6. 

Example 2: Creation of pMON40 

5$ [01 91] Five ug of plasmid pMON 1 001 (described In Example 1 ) was digested with Smal. The reaction was terminated 
by phenol extraction, and the DNA was precipitated by ethanol. A BamHI linker CCGGATCCGG (0.1 ug), which had 
been phosphorylated with ATP and T4 polynucleotide kinase (Bethesda Research Laboratory, Rockville, MD) was 
added to 1 ug of the pMONI 001 fragment. The mixture was treated with T4 DNA ligase (1 00 units) for 1 8 hours at 1 4" 
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C. After heating at 70** C for 10 min to inactivate the DNA ligase, the DNA mixture was digested with BamHI endonu- 
clease (20 units, 3 hours, 37"* C) and separated by electrophoresis on an 0.5% agarose gel. The band corresponding 
to the 4.2 kb Smal-BamHI vector fragment was excised from the gel. The 4.2 kb fragment was purified by adsorption 
on glass beads (Vogelstein and Gillespie, 1979), ethanol precipitated and resuspended In 20 ul of DNA llgase buffer 

s with ATP. T4 DNA ligase (20 units) was added and the mixture was incubated for 1 .5 hours at room temperature. The 
DNA was mixed with rubidium chloride-shocked E. coli C600 cells for DNA transformation (Maniatis et al, 1982). After 
expression for 1 hour at 37° C in LB, the cells were spread on LB plates containing 200 ug/ml of ampiclllin and 20 ug/ 
ml kanamycin. The plates were Incubated at 37° C for 16 hours. Twelve amplcillin-resistant, kanamycin-resistant col- 
onies were chosen, 2 ml cultures were grown, and mini-plasmid preparations were performed. Endonuclease mapping 

10 of the plasmids revealed that ten of the twelve contained no Smal site and a single BamHI site, and were of the ap- 
propriate size. 4.2 kb. The plasmid from one of the ten colonies was designated as pMON40. as shown in Figure 6. 

Example 3: Creation of NOS Promoter Fragment 

15 [0192] An oligonucleotide with the following sequence, 5'-TGCAGATTATTTGG-3', was synthesized (Beaucage and 
Carruthers. 1981, as modified by Adams et al. 1982). This oligonucleotide contained a^P radioactive label, which 
was added to the 5' thymidine residue by polynucleotide kinase. 

[0193] An M13 mp7 derivative, designated as SI A. was given to Applicants by M. Bevan and M.-D. Chilton, Wash- 
ington University. St. Louis, MO. To the best of Applicants' knowledge and belief, the SIA DNA was obtained by the 

20 following method. A pTiT37 plasmid was digested with Hindlll, and a 3.4 kb fragment was Isolated and designated as 
the Hindlll-23 fragment. This fragment was digested with Sau3a. to create a 344 bp fragment with Sau3a ends. This 
fragment was inserted Into double-stranded, replicative fomn DNA from the M13 mp7 phage vector (Messing et al, 
1981) which had been cut with BamHI. Two recombinant phages with 344 bp inserts resulted, one of which contained 
the anti-sense strand of the NOS promoter fragment. That recombinant phage was designated as SIA, and a clonal 

25 copy was given to the Applicants. 

[0194] The Applicants prepared the single-stranded form of the SIA DNA (14.4 ug; 6 pmol), and annealed it (10 
minutes at 70° C. then cooled to room temperature) with 20 pmol of the 14-mer oligonucleotide, mentioned above. 
The oligonucleotide annealed to the Sau3a insert at bases 286-300 as shown on Figures 4 and 5. 
[0195] 200 ul of the SIA template and annealed oligonucleotide were mixed with the four dNTP's (present at a final 

30 concentration of ImM. 25 ul) and 50 ul of Klenow polymerase. The mixture incubated for 30 minutes at room temper- 
ature. During this period, the polymerase added dNTP's to the 3' end of the oligonucleotide. The polymerase was heat- 
inactivated (70°C. 3 min), and Haelll (160 units) were added. The mixture was incubated (1 hour, 55° C), the Haelll 
was inactivated (70°C, 3 min), and the four dNTP's (ImM. 12 ul) and T4 DNA polymerase (50 units) were added. The 
mixture was incubated (1 hour, 37° C) and the polymerase was inactivated (70°C, 3 min). This yielded a fragment of 

35 about 570 bp. EcoRI (150 units) was added, the mixture was incubated (1 hour. 37° C) and the EcoRI was inactivated 
(70 C, 3 min). 

[0196] Aliquots of the mixture were separated on 6% polyacrylamide with 25% glycerol. Autoradiography revealed 
a radioactively labelled band about 310 bp In size. This band was excised. The foregoing procedure is indicated by 
Figure 5. 

40 

Example 4: Creation of pMON58 

[0197] Five ug of plasmid pMON40 (described in Example 2) were digested with Bglll (10 units, 1.5 hour, 37° C), 
and the Bglll was inactivated (70 ° C, 10 min). The four dNTP's (ImM, 5 ul) and Klenow polymerase (8 units) were 

45 added, the mixture was incubated (37°C, 40 min), and the polymerase was inactivated (70 ° C. 10 min). EcoRI (10 
units) was added and incubated (1 hour. 37° C), and calf alkaline phosphatase (CAP) was added and incubated (1 
hour, 37° C). A fragment of about 3.9 kb was purified on agarose gel using NA-45 membrane (Scheicher and Scheull, 
Keene NH). The fragment (1 .0 pM) was mixed with the NOS promoter fragment (0.1 pM), described in Example 3, and 
with T4 DNA ligase (100 units). The mixture was incubated (4° C. 16 hr). The resulting plasmids were inserted into E. 

50 coli cells, which were selected on media containing 200 ug/ml amplcillin. Thirty-six clonal Amp^^ colonies were selected, 
and mini-preps of plasmids were made from those colonies. The plasmid from one colony demonstrated a 308 bp 
EcoRI-Bglll fragment, a new Sstll cleavage site carried by the 308 bp NOS fragment, and a new PstI site. This plasmid 
was designated as pMON58, as shown in Figure 7. pMON58 DNA was prepared as described above. 

55 Example 5: Creation of PMON42 

[0198] Plasmid pBR325-Hindlll-23. a derivative of plasmid pBR325 (Bolivar, 1 978) carrying the Hindlll-23 fragment 
of pTIT37 (see Figure 3) in the Hindlll site, was given to Applicants by M. Bevan and M.-D. Chilton, Washington Uni- 
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versity, St. Louis. MO. DNA of this plasmid was prepared and 30 ug were digested with Hindlll (50 units) and BamHI 
(50 units). The 1.1 kb Hindlll-BannHI fragment was purified by adsorption on glass beads (Vogelstein and Gillespie, 
1 979) after agarose gel electrophoresis. The purified fragment (0.5 ug) was added to 0.5 ug of the 2.9 kb Hindlll-BamHI 
fragment of pBR327. After treatment with DNA ligase (20 units. 4 hours, 22*C), the resulting plasmids were introduced 
5 to E. coli C600 cells. Clones resistant to ampicillin at 200 ug/ml were selected on solid media; 220 clones were obtained. 
Minipreps of plasmid DNA were made from six of these clones and tested with the presence of a 1 . 1 kb fragment after 
digestion with Hindlll and BamHI . One plasmid which demonstrated the correct insert was designated pMON42. Plasmid 
pMON42 DNA was prepared as described in previous examples. 

10 Example 6: Creation of l\413 Clone M-2 

[0199] Seventy-five ug of plasmid pMON42 (described in Example 5) prepared from dam- EcoH cells were digested 
with Rsal and BamHI (50 units of each. 3 hours, 37» C) and the 720 bp Rsal-BamHI fragment was purified using NA- 
45 membrane. Eight ug of the purified 720 bp BamHI-Rsal fragment were digested with Mbol (10 min, 70" C), the ends 

IS were made blunt by filling in with the large Klenow fragment of DNA polymerase I and the four dNTPs. Then 0.1 ug 
of the resulting DNA mixture was added to 0.05 ug of Ml 3 mp8 previously digested with Smal (1 unit, 1 hour 37° C) 
and calf alkaline phosphatase (0.2 units). After ligation (10 units of T4 DNA ligase, 16 hours, 12^* C) and transfection 
of E. coN JM101 cells, several hundred recombinant phage were obtained. Duplex RF DNA was prepared from twelve 
recombinant, ph age-carrying clones. The RE DNA (0. 1 ug) was cleaved with EcoRt, (1 unit, 1 hour, 37** C). end-labeled 

20 with 32p<jATP and Klenow polymerase, and re-digested with BamHI (I unit. 1 hour. 37' C). The Ecoi=^l and BamHI 
sites span the Smal site. Therefore, clones containing the 260 bp Mbol fragment could be identified as yielding a 
labelled 270 bp fragment after electrophoresis on 6% polyacrylamide gels and autoradiography Four of the twelve 
clones carried this fragment. The orientation of the Insert was detemained by digestion of the EcoRI-cleaved, end- 
labeled RF DNA (0.1 ug) with Hinfl (1 unit, 1 hour, 37" C), Hinfl cleaves the 260 bp Mbol fragment once 99 bp from the 

25 3' end of the fragment and again 42 bp from the end nearest the NOS coding region. Two clones of each orientation 
were obtained. One clone, digested as M-2 as shown In Figure 8, contained the 260 bp fragment with the EcoRI site 
at the 3' end of the fragment. M-2 RF DNA was prepared using the procedures of Messing, et al 1 981 . 

Example 7: Creation of pMON75 and pMON76 

30 

[0200] Fifty ug of M-2 RF DNA (described In Example 6) were digested with 50 units of EcoRI and 50 units of BamHI 
for 2 hours at 37°C. The 270 bp fragment (1 ug) was purified using agarose gel and N A-45 membrane. Plasmid pMON58 
(described in Example 4) was digested with EcoRI and BamHI (50 ug, 50 units each, 2 hours, 37° C) and the 1 300 bp 
fragment was purified using NA-45 membrane. The 270 bp EcoRI-BamHI (0.1 ug) and 1300 bp EcoRI-BamHI (0.5 ug) 
35 fragments were mixed, treated with T4 DNA ligase (2 units) for 1 2 hours at 1 4** C. After heating at 70" C for 1 0 minutes 
to inactivate the ligase, the mixture was treated with EcoRI (10 units) for 1 hour at 3/* C, then heated to 70°C for 10 
minutes to inactivate the EcoRI. This completed the assembly of a chimeric NOS-NPT ll-NOS gene on a 1.6 kb frag- 
ment, as shown on Figure 9. 

[0201] Plasmid pMON38 is a clone of the pTiT37 Hindlll-23 fragment inserted in the Hindlll site of pBR327 (Soberon 
40 et al, 1980). pMON38 DNA (20 ug) was digested with EcoRI (20 units. 2 hours, 37' C) and calf alkaline phosphatase 
(0.2 units, 1 hour, 37" C). The pMON38 DNA reaction was extracted with phenol, precipitated with ethanol. dried and 
resuspended in 20 ul of 10 mM Tris-HCI, 1 mM EDTA, pH 8. 

[0202] 0.2 ug of the cleaved pMON38 DNA was added to the chimeric gene mixture described above. The mixture 
was treated with T4 DNA ligase (4 units, 1 hour, 22 C) and mixed with Rb chloride-treated E. coli C600 rec A56 cells 

45 to obtain transformation. After plating with selection for ampicillin-resistant (200 ug/ml) colonies, 63 potential candidates 
were obtained. Alkaline mini-preps of plasmid DNA were made from 12 of these and screened by restriction endonu- 
clease digestion for the proper constructs. Plasmid DNA's that contained a 1 .5 kb EcoRI fragment and a new Bglll site 
were digested with BamHI to determine the orientation of the 1.5 kb EcoRI fragment. One of each insert orientation 
was picked. One plasmid was designated pMON75 and the other pMON76. as shown in Figure 9. DNA from these 

50 plasmids were prepared as described in previous examples. 

Example 8. Creation of plasmids PMON126 and pMON129 

[0203] The 1.5 kb EcoRI fragment was excised by EcoRI digestion from either pMON75 or pMON76 and purified 
55 after agarose gel electrophoresis as described in previous examples. Five ug of DNA from plasmid pMON120 (de- 
scribed in a separate application. "Plasmids for Transforming Plant Cells." (WO84/02919) cited previously) was digest- 
ed with EcoRI and treated with calf alkaline phosphatase. After phenol deproteinization and ethanol precipitation, the 
EcoRI-cleaved pMONI 20 linear DNA was mixed with 0.5 ug of the 1 .5 kb EcoRI chimeric gene fragment. The mixture 
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was treated with 2 units of T4 DNA ligasefor 1 hour at 22oC. After transformation of E, coH cells (Maniatis et.al. 1982) 
and selection of colonies resistant to spectinomycin (50 ug/nnl), several thousand colonies appeared. Six of these were 
picked, grown, and plasmid mini-preps made. The plasmid DNA's were digested with EcoRI to check for the 1.5 kb 
chimeric gene insert and with BamHI to determine the orientatbn of the insert. BamHI digestion showed that in 
5 pMON128 the chimeric gene was transcribed in the same direction as the Intact nopaline synthase gene of pMON1 20. 
The orientation of the insert in pMONI 29 was opposite that in pMONI 28; the appearance of an additional 1 .5 kb BamHI 
fragment in digests of pMON129 showed that plasmid pMON129 carried a tandem duplication of the chimeric NOS- 
NPT ll-NOS gene, as shown in Figure 10. 

10 Example 9: Creation of Plasmid pMON156 

[0204] Plasmids which contained CaMV DNA were a gift to Monsanto Company from Dr. R. J. Shepherd, University 
of California, Davis. To the best of Applicants' knowledge and belief, these plasmids (designated as pOSI) were obtained 
by inserting the entire genome of a CaMV strain designated as CM4-184 (Howarlh et 1 , 1 981 ) into the Sal I restriction 
15 site of a pBR322 plasmid (Bolivar et al, 1977). E. coli cells transformed with pOSI were resistant to amplcillin (Amp^^) 
and sensitive to tetracycline (Tet^). 

[0205] Various strains of CaMV suitable for isolation of CaMV DNA which can be used in this invention are publicly 
available; see. e.g., ATCC Catalogue of Strains II, p. 387 (3rd edition. 1981). 

[0206] pOSI DNA was cleaved with Hindlll. Three small fragments were purified after electrophoresis on an 0.8% 
20 agarose gel using NA-45 membrane (Schleicher and Schuell, Keene NH). The smallest fragment, about 500 bp in size, 
contains the 198 promoter. This fragment was further purified on a 6% acrylamide gel. After various manipulations 
which did not change the sequence of this fragment (shown in Figure 28), it was digested with Mbol to create a 455 
bp Hindlll-Mbol fragment. This fragment was mixed with a 1 250 bp fragment obtained by digesting pMON75 (described 
in Example 7 and shown in Figure 9) with Bglll and EcoRI. This fragment contains the NPTII structural sequence and 
25 the NOS 3' non-translated region. The two fragments were ligated together by their compatible Mbol and Bglll over- 
hangs to create a fragment containing the CaMV(19S)-NPTIl-NOS chimeric gene. This fragment was inserted into 
pMON120 which had been cleaved with Hindlll and EcoRI. The resulting plasmid was designated as pMON156, as 
shown in Figure 29. 

[0207] Plasmid pMON156 was inserted into E. coli cells and subsequently Into A. tumefaciens cells where it formed 
30 a co-integrate Tl plasmid having the CaMV(19S)-NPT ll-NOS chimeric gene surrounded by T-DNA borders. A. tume- 
faciens cells containing the co-integrate plasmids were co-cultivated with petunia cells. The co-cultivated petunia cells 
were cultured on media containing kanamycin. Some of the co-cultivated petunia cells sun/ived and produced colonies 
on media containing up to 50 ug/ml kanamycin. This indicated that the CaMV(1 9S)-NPT ll-NOS genes were expressed 
in petunia cells. These results were confirmed by Southern blot analysis of transformed plant cell DNA. 

35 

Example 10: Creation of pMONISS 

[0208] Plasmid pMON72 was obtained by inserting a 1.8 kb Hindlll-BamHI fragment from bacterial transposon Tn5 
(which contains an NPTII structural sequence) into a Pstl- pBR327 plasmid digested with Hindlll and BamHL This 
40 plasmid was digested with Bglll and Pstl to remove the NPTII structural sequence. 

[0209] Plasmid pMONIOOl (described in Example 1 and shown in Figure 6) from dam- cells was digested with Bglll 
and Pstl to obtain a 218 bp fragment with a partial NPTII structural sequence. This fragment was digested with Mbol 
to obtain a 1 94 bp fragment. 

[0210] A triple ligation was performed using (a) the large Pstl-Bglll fragment of pMON72; (b) the Pstl-Mbol fragment 
45 from pMONIOOl; and (c) a synthetic linker with Bglll and Mbol ends having stop codons in all three reading frames. 
After transformation of E. coli cells and selection for ampicillin resistant colonies, plasmid DNA from Amp^^ colonies 
was analyzed. A colony containing a plasmid with the desired structure was identified. This plasmid was designated 
pMONIIO, as shown on Figure 30. 

[0211] In order to add the 3' end of the NPT II structural sequence to the 5' portion in pMONIIO, pMONllO was 
50 treated with XhoL The resulting overhanging end was filled in to create a blunt end by treatment with Klenow polymerase 
and the four deoxy-nucleotide triphosphates (dNTP's), A, T C, and G. The Klenow polymerase was inactivated by 
heat, the fragment was digested with Pstl, and a 3.6 kb fragment was purified. Plasmid pMON76 (described in Example 
7 and shown in Figure 9) was digested with Hindlll, filled in to create a blunt end with Klenow polymerase and the four 
dNTP's, and digested with Pstl, An 1100 bp fragment was purified, which contained part of the NPT II structural se- 
ss quence, and a nopaline synthase (NOS) 3' non-translated region. This fragment was ligated with the 3.6 kb fragment 
from pMONIIO. The mixture was used to transform E. coli cells; Amp^^ cells were selected, and a colony having a 
plasmid with the desired structure was identified. This plasmid was designated pMON132, as shown on Figure 31. 
Plasmid pMON93 (shown on Figure 28) was digested with Hindlll, and a 476 bp fragment was isolated. This fragment 
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was digested with Mbol, and a 455 bp fragment was purified which contained the CaMV (19S) promoter region and 5' 
non-translated region. Plasmid pMONI 32 was digested with EcoRl and Bglll to obtain a 1 250 bp fragment with (1 ) the 
synthetic tinker equipped with stop codons in all three reading frames; (2) the NPT It structural sequence; and (3) the 
NOS 3' non-translated region. These two fragments were joined together through the compatible Mbol and Bglll ends 
s to create a CaMV (19S)-NPT ll-NOS chimeric gene. 

[0212] This gene was Inserted Into pMON120, which was digested with Hindlll and EcoRI, to create plasmid 
pMON155, as shown in Figure 32. 

[0213] Plasmid pMON155 was inserted into A. tumefaciens GV3111 celts containing a Ti plasmid, pTlB653. The 
pMONI 55 plasmid formed a cointegrate plasmid with the Ti plasmid by means of a single crossover event. Cells which 
10 contain this co-integrate plasmid have been deposited with the American Type Culture Collection, and have been 
assigned ATCC accession number 39336. A fragment which contains the chimeric gene of this invention can be ob- 
tained by digesting the co-integrate plasmid with Hindlll and EcoRI, and purifying the 1 .7 kb fragment. These cells have 
been used to transform petunia cells, allowing the petunia cells to grow on media containing at least 100 ug^t kan- 
amycin. 

75 

Example 11: Creation of PMONI 83 and 184 

[0214] Plasmid pOSI (described in Example 9) was digested with Bglll, and 1200 bp fragment was purified. This 
fragment contained the 32S promoter region and part of the 5' non-translated region. It was inserted into plasmid 

20 pSHL72 which had been digested with BamHI and Bglll (pSHL72 Is functionally equivalent to pAG060, described in 
Colbere-Garapin et al, 1981). The resulting plasmid was designated as pMONSO, as shown on Figure 33. 
[0215] The cloned Bglll fragment contains a region of DNA that acts as a polyadenylatlon site for the 32S RNA 
transcript. This polyadenylatlon region was removed as follows: pMON50 was digested with Avail and an 1100 bp 
fragment was purified. This fragment was digested with EcoRI* and EcoRV. The resulting 190 bp EcoRI-EcoRV frag- 

2S ment was purified and inserted into plasmid pBR327, which had been digested with EcoRI and EcoRV, The resulting 
plasmid, pMONBI . contains the CaMV 32S promoter on a 190 bp EcoRV-EcoRI fragment, as shown on Figure 33. 
[0216] To make certain the entire promoter region of CaMV(32S) was present in pMON81 , a region adjacent to the 
5' (EcoRV) end of the fragment was inserted into pMONBI in the following way Plasmid pMON50 prepared from dam- 
cells was digested with EcoRI and Bglll and the resultant 1550 bp fragment was purified and digested with Mbol. The 

30 resulting 725 bp Mbol fragment was purified and inserted into the unique Bglll site of plasmid pKC7 (Rao and Rogers, 
1979) to give plasmid pMON125, as shown in Figure 34. The sequence of bases adjacent to the two Mbol ends re- 
generates Bglll sites and allows the 725 bp fragment to be excised with Bglll. 

[0217] To generate a fragment carrying the 32S promoter, the 725 bp Bglll fragment was purified from pMON125 
and was subsequently digested with EcoRV and Alul to yield a 1 90 bp fragment. Plasmid pMON8l was digested with 
35 BamHI, treated with Klenow polymerase and digested with EcoRV. The 3.1 kb EcoRV-BamHI(blunt) fragment was 
purified, mixed with the 190 bp EcoRV-Alul fragment and treated with DNA ligase. Following transformation and se- 
lection of ampicillln-resistant cells, plasmid pMONI 72 was obtained which carries the CaMV(32S) promoter sequence 
on a 380 bp BamHI-EcoRI fragment, as shown on Figure 35. This fragment does not carry the poly-adenylation region 
for the 32S RNA. Ligation of the Alul end to the filled-in BamHI site regenerates the BamHI site. 
40 [0218] To rearrange the restriction endonuclease sites adjacent to the CaMV(32S) promoter, the 380 bp BamHI - 
EcoRI fragment was purified from pMON172. treated with Klenow polymerase, and inserted into the unique Smal site 
of phage M13 mp8. One recombinant phage, M12. carried the 380 bp fragment in the orientation shown on Figure 36. 
The replicative form DNA from this phage carries the 32S promoter fragment on an EcoRI(5')-BamHI(3') fragment. 
[0219] Plasmlds carrying a chimeric gene (CaMV(32S) promoter region-NPT It structural sequence-NOS 3' non- 
45 translated region) were assembled as follows. The 380 bp EcoRI-BamHI CaMV (32S) promoter fragment was purified 
from phage M12 RF DNA and mixed with the 1250 bp Bglll-EcoRI NPT ll-NOS fragment from pMON75. Joining of 
these two fragments through their compatible BamHI and Bglll ends results in a 1 .6 kb CaMV(32S)-NPT ll-NOS chi- 
meric gene. This gene was Inserted Into pMON120 at the EcoRI site in both orientations. The resultant plasmlds. 
pMON183 and 184, appear in Figure 37. These plasmids were used to transform petunia cells. The transformed cells 
so are capable of growth on media containing 100 ug/ml kanamycin. 
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1. A chimeric gene capable of expressing a neomycin phosphotransferase polypeptide in plant cells conferring an- 
tibiotic resistance to the plant when inserted into the plant genome, comprising in sequence: 
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(a) a promoter region from a ribulose-1,5-bis-phosphate carboxylase small subunit gene; 

(b) a 5' non-translated region; 

(c) a structural coding sequence encoding neomycin phosphotransferase I or II; and 

(d) a 3' non-translated region of a gene naturally expressed in plant cells, said region encoding a signal se- 
quence lor polyadenylation of mRNA; said promoter being heterologous with respect to the structural coding 
sequence. 

2. A gene of Claim 1 in which the 3' non-translated region is selected from a gene from the group consisting of the 
genes from the T-DNA region of Agrobacterium tumefaciens. 

3. A gene of Claim 1 in which the 3" non-translated region Is from the nopallne synthase gene of Agrobacterium 
tumefaciens. 

4. A chimeric gene capable of expressing a polypeptide In plant cells comprising in sequence: 



(a) a full-length transcript promoter region isolated from cauliflower mosaic virus; 

(b) a 5' non-translated region; 

(c) a structural coding sequence; 

(d) a 3' non-translated region of a gene naturally expressed in plants, said region encoding a signal sequence 
20 for polyadenylation of mRNA, said structural coding sequence being heterologous with respect to said pro- 
moter region. 

5. A gene of Claim 4 in which the 3' non-translated region Is from a nopaline synthase gene. 
25 8. A culture of microorganisms identified by ATCC accession number 39265. 

Patentanspruche 

30 1, Chimares Gen, welches ein Neomycin-Phototransferase-Polypeptid in Pflanzenzellen exprimieren kann, das der 
Pflanze Antibiotikum-Resistenz verleiht, wenn es in das RIanzengenom insertlert wlrd, umfassend in Sequenz: 

(a) eine Pronriotorregion von einem kleinen Ribulose-1,5-bis-phosphatcarboxylase-Untereinheit-Gen; 

(b) eine nicht-translatierte 5'-Region; ^ 
35 (c) eine Strukturcodiersequenz, die fur Neomycin-Phototransf erase I Oder It codierl; und 

(d) eine nicht-translatierte 3'-Region eines Gens, das naturlich in Pflanzenzellen exprimiert wlrd, welche Re- 
gion fur eine Signalsequenz zur Polyadenyllerung von mRNA codiert; welcher Promoter in bezug auf die Struk- 
turcodiersequenz heterolog ist. 

40 2. Gen nach Anspruch 1 , worin die nicht-translatierte 3'-Region aus einem Gen von der Gruppe bestehend aus den 
Genen der T-DNA-Region von Agrobacterium tumefaciens ausgewahit isl. 

3. Gen nach Anspruch 1 , worin die nicht-translatierte 3'-Region von der Nopal in-Synthase von Agrobacterium tume- 
faciens ist. 



4. Chimares Gen. welches ein Polypeptid in Pflanzenzellen exprimieren kann, umfassend in Sequenz: 



(a) eine vollstandige Trans kript-Promotorregion, die aus Blumenkohl-Mosaikvirus Isoliert wurde; 

(b) eine nicht-translatierte 5'-Region; 
50 (c) eine Strukturcodiersequenz; 

(d) eine nicht-translatierte 3'-Region eines (Sens, das naturlich in Pflanzen exprimiert wird, welche Region fur 
eine Signalsequenz zur Polyadenyllerung von mRNA codiert; welche Strukturcodiersequenz in bezug auf die 
Promotorregion heterolog ist. 

55 5. Gen nach Anspruch 4, worin die nicht-translatierte 3'-Region von einem Nopalin-Synthase-Gen ist. 

6. Kultur von Mikroorganismen, identifiziert durch die ATCC-Eingangsnummer 39265. 
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Revendications 

1. Gdne chimdre capable d'exprimer un polypeptide qui est une ndomycine phosphotransferase dans des cellules 
v^g^tales conf^rant une antlblordslstance k la plante quand 11 est ins6r6 dans le genome d'une plante. comprenant 

s en sequence : 

(a) une region promotrice d'un gdne d'une petite sous-unit6 de la ribulose-1 ,5-bis-phosphate carboxylase; 

(b) une region 5' non tradulte; 

(c) une sequence structurale codante, codant pour la ndomycine phosphotransferase I ou 11; et 

10 (d) une region 3' non tradulte d'un gene exprlme naturellement dans les cellules vdgetales, ladite region codant 

pour une sequence-signal pour la polyaddnylation d'ARN messager; ledit promoteur 6tant h^tdrologue par 
rapport k la sequence structurale codante. 

2. Gdne selon la revendication 1 . dans lequel la region 3' non tradulte est un g^ne choisi dans un groupe constitue 
15 des gfenes de la region de TADN-T d'Agrobacterium tumefaciens. 

3. G^ne selon la revendication 1, dans lequel la region 3' non tradulte provient du gene de la nopaitne synthetase 
d'Agrobacterium tumefaciens. 

20 4. G6ne chimdre capable d'exprimer un polypeptide dans des cellules vdgetales comprenant. en sequence : 

(a) une region promotrice de transcription complete, isoiee du virus de la mosaique du chou-fleur; 

(b) une region 5' non traduite; 

(c) une sequence structurale codante; et 

25 (d) une region 3' non traduite d'un gene exprime naturellement dans les plantes; ladite region codant pour 

une sequence-signal pour la polyadenylation de I'ARN messager et ladite sequence structurale codante etant 
heterologue par rapport a ladite region promotrice. 
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55 



5. Gene de la revendication 4, dans lequel le region 3' non traduite vient d'un gene de la nopaline synthetase. 

6. Culture de micro-organ Ismes identifies par le numero d'accession ATCC 39265. 
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— GCTAG j< 



Xbol 



MIX.UGATE, 
TRANSFORM 
SELECT Ampt 
CELLS 



PARTIAL NPT II 

STRUCTURAL 

SEQUENCE 



89111 



DIGEST Mbo I 
PURIFY 194 bp 
FRAGMENT 



FIG.30. 




Xbo I --—SYNTHETIC LINKER 
Mbol 



Pst I 



Xhol 
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P»tl 



Xhol 




OIGeST WITH Xbol 
FILL-IN WITH KLENOW 
POLYMERASE-f 4 dNTPs 

DIGEST WITH Pst I 
PURIFY 3.6 Kb 
FRAGMENT 




Pst I 



Bom HI 



EcoRI 
Hind III 



DIGEST WITH HilMl III 
FILL-IN WITH KLENOW 
POUT MERASE4'4d NTPs 

DIGEST WITH Pst I 
PURIFY llOObp 
FRAGMENT 



Hind IIKBLUNT) 
Bam HI 



3' PORTION OF 
NPT II STRUCTURAL 
SEQUENCE 



NOS 3' 
REGION 



Xhol 
(BLUNT) 



FIG. 31 



MIX.LIGATE. 
TRANSFORM.SELECT 
Am pR CELLS 



Pst I 




BomHI 



EeoRI 
Hind III 
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Hind III 



Sod 
Mbol 




B9I II 



Xbo( 



Hind III 



OIQeST 
Hind III 
PURIFY 
476 bp 
FRAGMENT 



Hind III 



I9S PROMOTER 
5' REGIONS V DIGEST Mbol 

.PURIFY 495 bp 
\FRA6MENT 
Hind III \ Mbol 
^ ^ Soc 

^NOPAUNE 
SYNTHASE 




SomHI 

NOS 
POLV-A 
SIGNAL 

DIGEST EeoRI 
89111 

PURIFY 1250 bp 
FRAGMENT 




EcoRI 



Bom HI 



MODIFIED NPT II NOS 3' 
STRUCTURAL NON>TRANS- 
SEQUENCE ^LATED REGION 



Hind II 



FIG. 32. 



MIX, LIGATE. TRANSFORM, 
SELECT Spc« CELLS 



NOPALINE 
SYNTHASE 



Hind ill 



Soc i 



Mbol 



pMON 155 



19 S 

PROMOTER 

'REGIONS 



SpCy 



^Str 



NOS 



MODIFIED 
STRUCT 



Bgi II 



NPT 



'Xboi 



EcoRI 



-Bom HI 
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Soil 



POUY-A SITE 



Sal I 




DIGEST WITH 
Bglll 

PURIFY 1200 bp 
FRAGMENT 



CoMV 
ONA 



Bgi II 

bz 



EcoRl 



Co MV(32S) PROMOTER, 
5' LEADER REGION 



CcoRI 



SomHI 



I pSHL 72 I 



DIGEST BomHI. 
BqIII 



Bfllll/MIX. 

o u. ^ UIGATE, 
9ammy ^TRANSFORM, 
B9I II SELECT 
Amp** 
CELLS 




DIGEST WITH 

EcoRI*EcoRTr 
PURIFY 190 bp 
FRAGMENT 



MIX, U6ATE.TRANSF0RM, SELECT 
^ ^mp« CELLS „ 

DIGEST EcoRT^ 

PURIFY 3.1 Kb /XAmpR^^^ 
FRAGMENT // 32 S 

' ' PROMOTER, . 
5' NON -TRANSLATED 

REGIONS 
pMON 61 



FIG. 33. 
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EeoRI 




DIGEST 
EeoRI, 
Bgill 



Bgl II 



EcoRl 



Mbol 



PURIFY 
1550 bp 
FRAGMENT 



Mbol 



3 



EMRT 

=i 



EeoR7 



Bglll 



Bgl II 



CoMVSZSPROMOTERt 
5* NON -TRANSLATED 
REGIONS 

DIGEST Mbol 
PURIFY 725 bp 
FRAGMENT 



Mbol EcoR7 Mbol 




FIG. 34. 



MIX.UGATE, 
TRANSFORM, SELECT 

Ampl^CELLS 



Bgl II 



Amp" 

CqMV vv\ 
32S 
PROMOTER,\V 
5' NON- |1 

PM0NI25 TRANS-// 
REGIONS 



'EcoR3r 



Bgl II 
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Bgl II 




ECQR7 



DIGEST Bam HI 
FILL-IN WITH KLENOW 
POLYMERASE •*> 4d NTPs 



EcoRX 



DIGEST EcoRT 
PURIFY 3.1 Kb 
FRAGMENT 



EtoRir 



Bgl II 



B9III 



DIGEST Bgl II 
PURIFY 725 bp 
FRAGMENT 

Alul 



Bgl 11 

d 




DIGEST EeoR7 
Alul 

PURIFY 190 bp 
FRAGMENT 

.Alul 



FIG. 35. 



MIX.LIGATE. 
TRANSFORM, SELECT 
Ampf* CELLS 



EcoRI EcoRX 




Bom HI 
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EeoRI 




EcoRI^Smol 
•Bom Hi 



Bom HI 




DIGEST Eeo Ri, 

Bom HI ^ 
FILL-IN ENDS WITH 

KLENOW POLYMERASE 
+ 4d MTP« 
PURIFY 380 bp 
FRAGMENT 



DIGEST Smo I, CAP 



MIX.LIGATE. 
TRANSFORM. SELECT 
Amp" CELLS 




Bom HI 



F/G. 36. 
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NOS POLY-A 
SIGNAL 

Bam HI 



DIGEST EeoRl.Bgl II 
PURIFY 1250 bp 
FRAGMENT 




NOS 3' 
NON- 
TRANS- 
LATED 
REGION 



Bgl II /BamHI 
JOINT 



Bom Hi/ Bglli 
JOINT 



Bam HI 



'^yEcoRI 

NOS 
POLY- A 
SIGNAL 



64 



